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IMMUNOTOXINS COMPRISING 6EL0NIN AND AN ANTIBODY 



—F-X^ELD OF THE INVENTION 
This application is a continuation-in-part of co- 
pending U.S. Patent Application Serial No. 08/064,691 filed 
5 May 12, 1993, which is a continuation-in-part of co-pending 
U.S. Patent Application Serial No. 07/988,430 filed 
December 9, 1992, which is in turn a continuation-in-part 
of pending U.S. Patent Application Serial No. 07/901,707 
filed June 19, 1992, which in turn is a continuation-in- 

10 part of U.S. Patent Application Serial No. 07/787,567 filed 
November 4, 1991, now abandoned. 

The present invention genially relates to 
materials useful as components of cytotoxic therapeutic 
agents. More particularly, the invention relates to 

15 ribosome-inactivating proteins, to analogs of ribosome- 
inactivating proteins, to polynucleotides encoding such 
proteins and analogs, some of which are specifically 
modified for conjugation to targeting molecules, and to 
gene fusions of polynucleotides encoding ribosome- 

20 inactivating proteins to polynucleotides encoding targeting 

molecules. 

BACKGROUND 

Ribosome-inactivating proteins (RIPs) comprise a 
class of proteins which is ubiquitous in higher plants. 

25 However, such proteins have also been isolated from 

bacteria. RIPs are potent inhibitors of eukaryotic protein 
synthesis. The N-glycosidic bond of a specific adenine 
base is hydrolytically cleaved by RIPs in a highly 
conserved loop region of the 28S rRNA of eukaryotic 

30 ribosomes thereby inactivating translation. 

Plant RIPs have been divided into two types. 
Stirpe et a2., FEBS Lett.^ 195 (1,^2) tl-B (198 6). Type I 
proteins each consist of a single peptide chain having 
ribosome-inactivating activity, while Type II proteins each 

35 consist of an A-chain, essentially equivalent to a Type I 
protein, disulf ide-linXed to a B-chain having cell-binding 
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properties. Gelonin, dodecandrin, tricosanthin, 

tricokirin, bryodin, Mirabilis antiviral protein (MAP), 
bariey ribosome-inactivating protein (BRIP) , pokeweed 
antiviral proteins (PAPs) , saporins, luff ins, and momordins 
5 are examples of Type I RIPs; whereas Ricin and abrin are 
examples of Type II RIPs, 

Amino acid sequence information is reported . for 
various ribosome- inactivating proteins. It appears that, at 
-least the tertiary structure of RIP active sites is 

10 conserved among Type I RIPs, bacterial RIPs, and A-chains 

of Type II RIPS. In many cases, primary structure homologry 
is also found. Ready et al., J. Siol. Chem.^ 

259 f24; : 15252-15256 (1984) and other reports suggest that 
the two types of RIPs are evolutionarily related. 

X5 Type I plant ribosome-inactivating. proteins may 

be particularly suited for use as components of cytotoxic 
therapeutic agents. A RIP may be conjugated to a targeting 
agent which will deliver the RIP to a particular cell type 
in vivo in order to selectively kill those cells. 

20 Typically, the targeting agent (e.g., an antibody) is 

linked to the toxin by a disulfide bpnd which is reduced in 
vivo allowing the protein toxin to separate from the 
delivery " antibody and become active intracellular ly. 
Another strategy for producing targeted cytotoxic proteins 

25 is to express a gene encoding a cytotoxic protein fused to 

a gene encoding a targeting moiety. The resulting protein 
product is composed of one or more polypeptides containing! 
the cytotoxic protein linked to, for example, at least one 
chain of an antibody. 

30 A variety of such gene fusions are discussed in 

Pastan et al.. Science, 254:1173-1177 (1991). However, 
these fusion proteins have been constructed with sequences 
from diphtheria toxin or Pseudomonas aeruginosa exotoxin A, 
both of which are ADP-ribosyltransf erases of bacterial 

35 origin. These protein toxins are reported to intoxicate 

cells and inhibit protein synthesis by mechanisms which 
differ from those of the RIPs. Moreover, diphtheria toxin 



M'O 94/26530 



PCT/DS94/05348 



-3- 

and exotoxin A are structurally different from, and show 
little amino acid sequence similarity with, RIPsf. In 
general, fusion proteins made with diphtheria toxin or 
exotoxin A have been immunogenic and toxic in animals, and 
are produced intracellular ly in relatively low yield. 
Another strategy for producing a cytotoxic agent is to 
express a gene encoding a RIP fused to a gene encoding a 
targeting moiety. The resulting protein product is a 
single polypeptide containing a RIP linked to, for example, 
at least one chain of an antibody. 

Because some RIPs, such as the Type I Rip 
gelonin, are primarily available from scarce plant 
materials, it is desirable to clone the genes encoding the 
RIPs to enable recombinant production of the proteins. It 
is also desirable to develop analogs of the natural 
proteins which may be easily conjugated to targeting 
molecules while retaining their natural biological activity 
because most Type I RIPs have no natural sites (i.e» 
available cysteine residues) for conjugation to targeting 
agents. Alternatively, it is desirable to develop gene 
fusion products including Type I RIPs as a toxic moiety and 
antibody substances as a targeting moiety. 

The present invention also provides. . novel 
humanized or human-engineered antibodies and methods for 
producing such antibodies which may be conjugated or fused 
to various toxins. Such conjugations or fusions are useful 
in the treatment of various disease states, including 
autoimmune diseases and cancer. 

There are several reports relating to replacement 
of amino acids in a mouse antibody with amino acids 
normally occurring at the analogous position in the human 
form of the antibody. See, e.g., Junghaus, et al.. Cancer 
Pes., 50: 1495-1502 (1990) and other publications which 
describe genetically-engineered mouse/human chimeric 
antibodies. Also by genetic engineering techniques, the 
genetic information from murine hypervariable 
complementarity determining regions (hereinafter referred 
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to as "CDRs") may be inserted in place of the DNA encoding 
the CDRs of a human monoclonal antibody to generate a 
construct encoding a human antibody with murine CDRs. See, 
e.g., Jones, et aJ., Nature^ 321: 522-525 (1986) • 
5 Protein structure analysis on such "CDR-grafted»» 

antibodies may be used to "add back" murine residues in 
order to restore lost antigen-binding capability, as 
described in Queen, et al, Proc. Natl. Acad. Sex* (USA), 
85:10029-10033 (1989); Co, et al., Proc. Nat. Acad, Sci. 

XO (USA), 88: 2869-2873 (1991). However, a frequent result of 

CDR-grafting is that the specific binding acitvity of the 
resulting humanized antibodies may *be diminished or 
completely abolished. 

As demonstrated by the foregoing, there exists a 

15 need in the art for cloned genes encoding Type I RIPs, for 

analogs of Type I RIPs which may be easily conjugated to 
targeting molecules, and for gene fusion products 
comprising Type I RIPs, and especially wherein such gene 
fusions also comprise an humanized antibody portion. 

20 SUMMARY OF THE INVENTIOK 

The present invention provides purified and 
isolated polynucleotides encoding Type I RIPs, Type I RIPs 
having a cysteine available for disulfide bonding to 
targeting molecules and fusion products comprising Type I 

25 RIPs. Vectors comprising the polynucleotides and host 

cells transformed with the vectors are also provided. 

A purified and isolated polynucleotide encoding 
natural sequence gelonin (SEQ ID NO: 11) , and a host cell 
including a vector encoding gelonin of the type deposited 

3 0 as ATCC Accession No. 68721 are provided. Further provided 

are a purified and isolated polynucleotide encoding natural 
sequence barley ribosome-inactivating protein and a 
purified and isolated polynucleotide encoding momordin II. 

Some of the polynucleotides mentioned above 

35 encode fusion proteins of the present invention comprising 

gelonin (SEQ ID NO: 2) or another RIP and an antibody or a 
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fragment comprising an antigen-binding portion thereof. 
Several alternate forms of fusion proteins comprising 
gelonin are contemplated herein. For example, the fusion 
protein may contain a single RIP fused to a monovalent 
5 antibody binding moiety, such as a Fab or single chain 

antibody. Alternatively, multivalent forms of the fusion 
proteins may be made and may have greater activity than the 
monovalent forms • In preferred embodiments of the 
invention, gelonin may be fused to either the carboxy or 

10 • the amino terminus of the antibody or antigen-binding 
portion of thereof. Also in a preferred embodiment of the 
invention, the antibody or fragment thereof comprising an 
antigen-binding portion may be an he3\ntibody, an heS-Fab, 
an he3 Fd, single-chain antibody, or an he3 kappa fragment. 

15 The antibody or antigen-binding portion thereof may be 
fused to gelonin by means of a linker peptide, preferably 
a peptide segment of shiga-like toxin as shown in SEQ ID 
NO: 56 or a peptide segment of Rabbit muscle aldolase as 
shown in SEQ ID NO: 57 or a human muscle aldolase^ an 

20 example of which is reported in Izzo, et al. , Eur. J. 

Biochem, 174i 569-578 (1988), incorporated by reference 
herein. 

T^alogs of a Type I plant RIP are defined herein 
as non-naturally occurring polypeptides that share the 

25 ribosome- inactivating activity of the natural protein but 

that differ in amino acid sequence from the natural type 1 
RIP protein in some degree but less than they differ from 
the amino acid sequences of other Type I plant RIP. 
Preferred analogs according to the present invention are 

30 analogs of Type I plant RIPs each having a cysteine 
available for disulfide bonding located at a position in 
^ its amino acid sequence from the position corresponding to 
position 251 in SEQ ID NO: 1 to the carboxy 1 terminal 
position of the analog. SEQ ID NO: i represents the amino 

35 acid sequence of ricin A-chain. Other preferred analogs 
according to the invention are Type I RIPs each having a 
cysteine available for disulfide bonding at a position in 
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the analog that is on the surface of the protein in its 
natural conformation and that does not impair native 
folding or biological activity of the ribosome-inactivating 
protein • Analogs of bacterial RIPs are also contemplated 
5 by the present invention. 

The present invention provides an analog of a 
Type I ribosome-inactivating protein, which analog has a 
cysteine available for intermolecular disulfide bonding at 
* an amino acid position corresponding to a position not 

10 naturally available for intermolecular disulfide bonding in 

the Type I ribosome-inactivating protein and which cysteine 
is located at a position in the amino*^acid sequence of the 
analog corresponding to position 259 in SEQ ID NO: 1 or at 
a position in the amino acid sequence in the analog 

15 corresponding to a position from position 251 in SEQ ID NO: j 

1 to the carboxyl terminal position of the analog. 

An analog according to the present invention may 
be an analog of gelonin. In an analog of gelonin according 
to the present invention, the cysteine may be at a position 

20 in the analog from position 24 4 to the carboxyl terminal 

position of the analog, more preferably at a position in 
the analog from position 247 to the cairboxyl terminal 
position of the analog, and most, preferably at position 
244, at position 247, or at position 248 of the amino acid 

25 sequence of the analog. In these analogs, it is preferred 

that the gelonin cysteine residues at positions 44 and 50 
be replaced with non-cysteine residues such as alanine. 

An analog according to the present invention may 
be an analog of barley ribosome-inactivating protein. 

30 Preferably, a cysteine in such an analog is at a position 

in the analog from position 256 to the carboxyl terminal 
position, and more preferably the cysteine is at a position 
in the amino acid sequence of the analog from position 260 
to the carboxyl terminal position of the analog. Most 

35 preferably, in these regions, the cysteine is at position 

256, at position 270, or at position 277 of the amino acid 
sequence of the analog. 
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An analog according to the present invention may 
be an analog of momordin II. 

Analogs according to the present invention may 
have a cysteine in the amino acid sequence of the analog at 
5 a position which corresponds to a position within one amino 

acid of position 259 of SEQ ID NO: 1. Such an analog may 
be an analog of gelonin, of barley ribosome-inactivating 
protein, or of momordin II, 

The present invention also provides a 

10 polynucleotide encoding an analog of a Type I ribosome- 
inactivating protein, which analog has a cysteine available 
for intermolecular disulfide bonding at an amino acid 
position corresponding to a position not naturally 
available for intermolecular disulfide bonding in the Type 

15 I ribosome-inactivating protein, and which cysteine is 

located at a position in the amino acid sequence of the 
analog from the position corresponding to position 251 in 
SEQ ID NO: 1 to the carboxyl terminal position of the 
analog. The polynucleotide may encode an analog of 

2 0 gelonin, preferably an analog wherein the cysteine is at a 

position in the amino acid sequence of the analog from 
position 244 to the carboxyl terminal position of the 
analog, more preferably wherein the cysteine is at a 
position in the analog from position 247 to the carboxyl 

25 terminal position of the analog, and most preferably the 

cysteine is at position 244, at position 247 or at position 
24 8 of the amino acid sequence of the analog. It. is 
preferred that a polynucleotide according to the present 
invention encode a gelonin analog wherein the native 

30 gelonin cysteine residues at positions 44 and 50 are 

replaced with non-cysteine residues, such as alanine. 

A polynucleotide according to the present 
invention may encode an analog of barley ribosome- 
inactivating protein, preferably an analog wherein the 

35 cysteine is at a position in the analog from position 256 

to the carboxyl terminal position of the analog, more 
preferably wherein the cysteine is at a position in the 
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analog from position 2 60 to the carboxyl terminal position 
of the analog, and most preferably wherein the cysteine is 

at"^sit:it)ir 256, at position 2-TO or— at— position 277 of the 

amino acid sequence of the analog. 
5 A polynucleotide according to the present 

invention may encode an analog of mormordin II. 

The present invention provides a vector including 
a polynucleotide encoding an analog of a Type I ribosome- 
inactivating protein, which analog has a cysteine available 

10. for intermolecular disulfide bonding at a amino acid 
position corresponding to a position not naturally 
available for intermolecular disulfide bonding in the Type 
I ribosome-inactivating protein and which cysteine is 
located at a position in the amino acid sequence of the 

15 analog from the position corresponding to position 251 in 
SEQ ID NO: 1 to the carboxyl terminal position of the 
analog* 

The present invention further provides a host 
cell including a DNA vector encoding an analog of a Type I 

20 ribosome-inactivating protein, which analog has a cysteine 

available for intermolecular disulfide bonding at an amino 
acid position corresponding to a position not naturally 
available for intermolecular disulfide bonding in the Type 
I ribosome-inactivating protein and which cysteine is 

25 located at a position in the amino acid sequence of the 

analog from the position corresponding to position 251 in 
SEQ ID NO: 1 to the carboxyl terminal position of the 
analog. In such a host cell the vector may encode an 
analog of gelonin, especially an analog wherein the 

30 cysteine is at position 247 of the amino acid sequence of 
the analog, such as in the host cell deposited as ATCC 
Accession No. 69009. 

A host cell according to the present invention 
may include a vector encoding barley ribosome-inactivating 

35 protein, especially preferred is a host cell containing a 
BRIP analog wherein the cysteine is at position 277, such 
as in the host cell deposited on October 2, 1991 with the 
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10 



American Type Culture Collection, 12301 Parklawn Drive, 
Rockville, MD 20852 as . ATCC Accession No. 68722.' 
Particularly preferred are prokaryotic host cells because 
such cells may be less sensitive to the action or RIPs as 
compared to e\ikaryotic cells. 

The present invention also provides an agent 
toxic to a cell including an analog of a Type I ribosome- 
inactivating protein linked by a disulfide bond through a 
cysteine to a molecule which specifically binds to the 
cell, which cysteine is at an amino acid position in the 
analog corresponding to a position not naturally available 
for intermolecular disulfide bonding in the Type I 
ribosome- inactivating protein and which cysteine is located 
in the amino acid sequence of the analog from the position 
15 corresponding to position 251 in SEQ id NO: 1 to the 
carboxyl terminal position of the analog. The agent may 
include an analog of gelonin, preferably an analog wherein 
the cysteine is at a position in the analog from position 

247 to the carboxyl terminal position of the analog, and 
more preferably wherein the cysteine is at position 247 or 

248 of the amino acid sequence of analog. An agent 
including an analog wherein the native gelonin cysteine 
residues at positions 4 4 and 50 are replaced with non- 
cysteine residues, such as alanine is preferred. 

An agent according to the present invention may 
include an analog of barley ribosome-inactivating protein, 
preferably an analog wherein the cysteine is at a position 
in the analog from position 260 to the carboxyl terminal 
position of the analog, more preferably wherein the 
cysteine is at a position in the analog from position 270 
to the carboxyl terminal position of the analog, and most 
preferably wherein the cysteine is at position 256, at 
position 270 or at position 277 of the amino acid sequence 
of the analog. 

An agent according to the present invention may 
include an analog of momordin II. 



20 



25 
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The present invention provides an agent wherein 
the Type I ribosome-inactivating protein is linked to an 

B7rtlbDdy-r'particul-ariy--tor-anr-He5 antdrb^ or to an antibody 

fragment, more particularly to an antibody fragment 
5 selected from the group consisting of chimeric and human 

engineered antibody fragments, and most particularly to a 
Fab antibody fragment, a Fab' antibody fragment or a F(ab*)2 
antibody fragment. It is highly preferred that an agent 
* according to the present invention include a chimeric or 

10 human engineered antibody fragment selected from the group 

consisting of a Fab antibody fragment, a Fab' antibody 
fragment and a F(ab')2 antibody fragment- 

A method according to the present invention for 
preparing an analog of a Type I ribosome-inactivating 

15 protein includes the step of expressing in a suitable host 

cell a polynucleotide encoding a Type I ribosome- 
inactivating fusion protein or type I RIP (especially 
gelonin) having a cysteine available for intermolecular 
disulfide bonding substituted (e.g., by site-directed 

20 mutagenesis of the natural DNA sequence encoding the RIP or 

by chemical synthesis of a DNA sequence encoding the RIP 
analog) at an amino acid position corresponding to a 
position not naturally available for intermolecular 
disulfide bonding in the Type I ribosome-inactivating 

25 protein, which cysteine is located at a position in the 
amino acid sequence of the analog from the position 
corresponding to position 251 in SEQ ID NO: 1 to the 
carboxyl terminal position of the analog. 

A product according to the present invention may 

30 be a product of a method including the step of expressing 

in a suitable host cell a polynucleotide encoding a Type I 
ribosome-inactivating protein having a cysteine available 
for intermolecular disulfide bonding substituted at an 
amino acid position corresponding to a position not 

35 naturally available for intermolecular disulfide bonding in 

the Type I ribosome-inactivating protein, which cysteine, is 
located at a position in the amino acid sequence of the 
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20 



25 



35 



analog from the position corresponding to position 251 in 

..^ H'^.^^I^®^^ position of the 

analog. 

The present invention provides a method for 
preparing an agent toxic to a cell including the step of 
linXing an analog of a Type 1 ribosome- inactivating protein 
through a cysteine to a molecule which specifically binds 
to the cell, which analog has the cysteine at an amino acid 
position corresponding to a position not naturally 
available for intermolecular disulfide bonding in the Type 
I ribosome- inactivating protein and which cysteine is 
located at a position in the amino ^cid sequence of the 
analog from the position corresponding to position 251 in 
SEQ ID NO: 1 to the carboxyl terminal position of the 
15 analog. 

According to the present invention, a method for 
treating a disease in which elimination of particular cells 
is a goal may include the step of administering to a 
patient having the disease a therapeutically effective 
amount of an agent toxic to the cells including a type I 
RIP (especially gelonin fused to or an analog of a Type I 
- ribosome- inactivating protein linked through a cysteine to 
a molecule which specifically binds to the cell, the analog 
having the cysteine at an amino acid position corresponding 
to a position not naturally available for intermolecular 
disulfide bonding in the Type I ribosome-inactivating 
protein and the cysteine being located at a position in the 
amino acid sequence of the analog from the position 
corresponding to position 251 in SEQ id NO: i to the 
3 0 carboxyl terminal position of the analog. 

Useful target cells for inmunotoxins of the 
present invention include, but are not limited to, cells 
which are pathogenic, such as cancer cells, autoimmune 
cells, and virally-infected cells. Such pathogenic cells 
may be targeted by antibodies or other targeting agents of 
the invention which are joined, either by genetic 
engineering techniques or by chemical cross-linking, to an 
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RIP. Specifically useful targets include tumor-associated 
antigens (e.g., on cancer cells), cell differentiation 
Barkers {e.g., on autoimmune cells), parasite-specific 
antigens, bacteria-specific antigens, and virus-specific 
5 antigens. 

The present invention also provides an analog of 
a Type I ribosome-inactivating protein, wherein the analog 
has a cysteine available for intennolecular disulfide 
bonding located at an amino acid position corresponding to 

10 a position not naturally available for intermolecular 

disulfide bonding in the Type I ribosome-inactivating 
protein and corresponding to a positJbn on the surface of 
ricin A-chain in its natural conformation, and wherein the 
analog retains the ribosome-inactivating activity of the 

15 Type I ribosome-inactivating protein. 

Such a fusion protein or an analog may be a 
fusion protein or an analog wherein the Type I ribosome 
inactivating protein is gelonin, and the analog is 
preferably an analog of gelonin wherein the cysteine is at 

20 position 10 of the amino acid sequence of the analog as 

encoded in a vector in a host cell deposited with the 
American Type Culture Collection, 12301 Parklawn Drive, 
Rockville, MD 20852 as ATCC Accession No. 69008 on June 9, 
1992. Other such gelonin analogs include those wherein the 

25 cysteine is at a position 60, 103, 146, 184 or 215 in the 

amino acid sequence of the gelonin analog. It is preferred 
that the gelonin cysteine residues at positions 44 and 50 
be replaced with non-cysteine residues such as alanine in 
these analogs. 

30 The present invention further provides an analog 

of a Type I ribosome-inactivating protein wherein the 
analog includes only a single cysteine. Such an analog may 
be an analog of gelonin and is preferably an analog wherein 
the single cysteine is at position 10, position 44, 

35 position 50 or position 247 in the amino acid sequence of 

the analog, but the cysteine may be located at other 
positions defined by the invention as well. 
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The present invention provides a polynucleotide 
encodin g an an alog of a Type I ribosome-inactivating 
protein, wherein the anaTog has ~a~cysteine available "for 
intermolecular disulfide bonding located at an amino acid 
position corresponding to a position not naturally 
available for intermolecular disulfide bonding in the Type 
I ribosome- inactivating protein and corresponding to a 
position on the surface of ricin A-chain in its natural 
conformation, and wherein the analog retains ribosome- 
inactivating activity of the Type I ribosome-inactivating 
protein-. 

According to the present invention, a method for 
preparing an analog of a Type I ribosome-inactivating 
protein may include the step of expressing in suitable host 
15 cell a polynucleotide encoding a Type l ribosome- 

inactivating protein having a cysteine available for 
intermolecular disulfide bonding substituted at an amino 
acid position corresponding to a position not naturally 
available for disulfide bonding in the Type I ribosome- 
inactivating protein, the cysteine is located at a position 
corresponding to an amino acid position on the surface of 
ricin A-chain in its natural conformation and which analog 
retains ribosome-inactivating activity of the Type I 
ribosome-inactivating protein. 

The present invention provides an agent toxic to 
a cell including an analog of a Type I ribosome- 
inactivating protein linked by a disulfide bond through a 
cysteine to a molecule which specifically binds to the 
cell, wherein the analog has a cysteine available for 
30 intermolecular disulfide bonding located at an amino acid 
position corresponding to a position not naturally 
available for intermolecular disulfide bonding in the Type 
I ribosome-inactivating protein and corresponding to a 
position on the surface of ricin A-chain in its natural 
35 conformation, and wherein the analog retains ribosome- 
inactivating activity of the Type I ribosome-inactivating 
protein. 
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A inethod according to the present invention for 
preparing an agent toxic to a cell may inclixde the step of 
linking an analog of a Type I ribosome-inactivating protein 
through a cysteine to a molecule which specifically binds 
5 to the cell, which analog has a cysteine available for 

intermolecular disulfide bonding located at an amino acid 
position corresponding to a position not naturally 
available for intermolecular disulfide bonding in the Type 
* I ribosome-inactivating protein and corresponding to a 
10 position on the surface of ricin A-chain in its natural 
conformation, and which analog retains ribosome- 
inactivating activity of the Type I 'J-ibosome-inactivating 
protein. 

A method according to the present invention for 

15 treating a disease in which elimination of particular cells 

is a goal includes the step of administering to a patient 
having the disease a therapeutically effective amount of an 
agent toxic to the cells wherein the agent includes a type 
I RIP fused to or an analog of a Type I ribosome- 

20 inactivating protein linked by a disulfide bond through a 

cysteine to a molecule which specifically binds to the 
cell, which analog has a cysteine available for 
intermolecular disulfide bonding located at an amino acid 
position corresponding to a position not naturally 

25 available for intermolecular disulfide bonding in the Type 
I ribosome- inactivating protein and corresponding to a 
position on the surface of ricin A~chain in its natural 
conformation, and which analog retains ribosome-^ 
inactivating activity of the Type I ribosome- inactivating 

30 protein. 

The RIP analogs of the invention are particularly 
suited for use as components of cytotoxic therapeutic 
agents. Cytotoxic agents according to the present 
invention may be used in vivo to selectively eliminate any 
35 cell type to which the RIP component is targeted by the 

specific binding capacity of the second component. To form 
cytotoxic agents, RIP analogs may be conjugated to 
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monoclonal antibodies, including chimeric and CDR-grafted 
antibodies, and antibody domains/ fragments (e.g.. Fab, 
Fab», F(ab»)2, single chain antibodies, and Fv or single 
variable domains) • Analogs of RiPs conjugated to 
5 monoclonal antibodies genetically engineered to include 
free cysteine residues are also within the scope of the 
present invention. Examples of Fab» and F(ab»)2 fragments 
useful in the present invention are described in co- 
pending, co-owned U.S. Patent Application Serial No. 
10 07/714,175, filed June 14, 1991 and in International 

Publication No. WO 89/00999 published on February 9, 1989, 
which are incorporated by reference herein i 

The RIP analogs of the invention may preferably 
be conjugated or fused to humanized or human engineered 
15 antibodies, such as the he3 antibody described herein. 

Such humanized antibodies may be constructed from mouse 
antibody variable domains, such as the mouse antibody H65 
(SEQ ID NOS: 123 and 124). Specifically RIP analogs 
according to the present invention may be conjugated or 
fused to he3 human-engineered antibody light and heavy 
chain variable regions (SEQ ID NO: 125 and 126, 
respectively) . or fragments thereof, A cell line producing 
an intact he3 immunoglobulin was deposited as ATCC 
Accession No. HB11206 with the American Type Culture 
25 Collection, 12301 Parklawn Drive, Rockville, Maryland 

20852. 

RIPS according to the present invention may also 
be conjugated to targeting agents other than antibodies, 
for example, lectins which bind to cells having particular 

30 surface carbohydrates, hormones, lymphokines, growth 
factors or other polypeptides which bind specifically to 
cells having particular receptors. Immunoconjugates 
including RIPs may be described as immunotoxins . An 
immunotoxin may also consist of a fusion protein rather 

35 than an immunocon jugate. 

The present invention provides gene fusions of an 
antigen-binding portion of an antibody (e.g., an antibody 
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light chain or truncated heavy chain, or a single chain 
antibody) or any targeting agent listed in the foregoing 

-paragraph,- linked- -to — a — ^T-ype -I — RIP. P-referably , the 

antigen-binding portion of an antibody or fragment thereof 
5 comprises a single chain antibody, a Fab fragment, or a 
F(ab')2 fragment. Active antibodies generally have a 
conserved three-dimensional folding pattern and it is 
expected that any antibody which maintains that folding 
• pattern will retain binding specificity. Such antibodies 
10 are expected to retain target enzymatic activity when 

incorporated into a fusion protein according to the present 
invention. 

It is sometimes necessary that immunotoxins 
comprising cytotoxic components , such as RIPs, be attached 

15 to targeting agents via cleavable linkers (i.e.^ 

disulfides, acid-sensitive linkers, and the like) in order 
to allow intracellular release of the cytotoxic component. 
Such intracellular release allows the cytotoxic component 
to function unhindered by possible negative kinetic or 

20 steric effects of the attached antibody. Accordingly, gene 

fusions of the present - invention may comprise a RIP gene 
fused, via a DNA segment encoding a linker protein as 
described above, to either the 5' or the 3' end of a gene 
encoding an antibody. If a linker is used, it preferably 

25 encodes a polypeptide which contains two cysteine residues 

participating in a disulfide bond and forming a loop which 
includes a protease-sensitive amino acid sequence (e«g., a 
segment of E. coli shiga-like toxin as in SEQ ID NO: 56) or 
a segment which contains several cathepsin cleavage sites 

30 a segment of rabbit muscle aldolase as in SEQ ID NO: 

57; a segment of human muscle aldolase; or a synthetic 
peptide including a cathepsin cleavage sites such as in SEQ 
ID NOs: 141 or 142) . A linker comprising cathepsin 
cleavage sites as exemplified herein comprises the C- 

35 terminal 20 amino acids of RMA. However, that sequence 

differs by only one amino acid from human muscle aldolase 
and it is contemplated that muscle aldolase from human or 
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other sources may be used as a linker in the manner 
described below. The Type I RIP portion of the fused genes 
preferably encodes gelonin, BRIP or momordin II. Also 
preferably, the antibody portion of the fused genes 
5 comprises sequences encoding one of the chains of ah 

antibody Fab fragment (i.e., kappa or Fd) and the fused 
gene is co-expressed in a host cell with the other Fab 
gene, or the antibody portion comprises sequences encoding 
a single chain antibody. 

10 ' Alternatively, since fusion proteins of the 

present invention may be of low (approximately 55 kDa) 
molecular weight while maintaining full enzymatic activity, 
such fusions may be constructed without a linker and still 
possess cytotoxic activity. Such low-molecular weight 

15 fusions are not as susceptible to kinetic and steric 

hinderance as are the larger fusions, such as fusions 
involving IgG molecules. Therefore, cleavage of the RIP 
away from the fusion may not be necessary to facilitate 
activity of the RIP. 

20 The present invention also provides a method for 

purifying a protein or immunotoxin comprising a ribosome- 
inactivating protein and a portion of an antibody including 
the steps of passing a solution containing the protein 
through an anion exchange column; applying the flow-through 

25 to a protein G column; and eluting the protein from the 

protein G column. The method may further comprise the 
steps of introducing the flow-through of the anion exchange 
column into a cation exchange column; exposing the cation 
exchange column to an eluent effective to elute said 

30 protein; and then applying the eluted protein to a protein 

G column, rather than applying the anion exchange column 
flow- through directly to a protein G column. 

Immunotoxins according to the present invention, 
including immunoconjugates and fusion proteins 

35 (immunofusions) , are suited for treatment of diseases where 

the elimination of a particular cell type is a goal, such 
as autoimmune disease, cancer, and graf t-versus-host 
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disease. The immunotoxins are also suited for use in 
causing immunosuppression and in treatment of infections by 
. — viruses- such— as— the~-Human Immunodef-ie-ieney-Vi-rus. - 

Specifically illustrating polynucleotide 
5 sequences according to the present invention are the 

inserts in the plasmid pING3731 in E. coli MClOGi 
(designated strain G274) and in the plasmid pING3803 in JE:. 
coli Elb4 (designated strain G275) , both deposited with the 
- American Type Culture Collection (ATCC) , 12301 Parklawn 

10 Drive, Rockville, Maryland, on October 2, 199 1, and 

assigned ATCC Accession Nos. 68721 and 68722, respectively. 
Additional polynucleotide sequence^ illustrating the 
invention are the inserts in the plasmid pING3746 in E. 
coli E104 (designated strain G277) and in the plasmid 

15 pING3737 in E. coli E104 (designated strain G276) , which 

were both deposited with the ATCC on June 9, 1992, and were 
respectively assigned Accession Nos. 69008 and 69009. 
Still other polynucleotide sequences illustrating the 
invention are the inserts in the plasmid pING3747 in E. 

20 coli E104 (designated strain G278) , in the plasmid pING3754 

in E. coli E104 (designated strain G279), in the plasmid 
pING3758 in E. coli E104 (designated strain G280) and in 
the plasmid pING3759 in E. coli E104 (designated strain 
G281) , which plasmids were all deposited with the ATCC on 

25 October 27, 1992 and were assigned ATCC Accession Nos. 

69101, 69102, 69103 and 69104, respectively. 

As noted above, RIPs may preferably be conjugated 
or fused to humanized or human-engineered antibodies, such 
as he3. Thus, the present invention also provides novel 

30 proteins comprising an humanized antibody variable domain 
which is specifically reactive with an h\iman CDS cell 
differentiation marker. Specifically, the present 

invention provides proteins comprising the he3 light and 
heavy chain variable regions as shown in SEQ ID NOS: 95 or 

35 96, respectively. DNA encoding certain he3 proteins is 

shown in SEQ ID NOS: 67 and 68. 
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In a preferred embodiment of the present 
invention, the protein comprising an hximanized antibody 
variable region is an intact he3 immunoglobui in deposited 
as ATCC HB 11206. 
5 Also in a preferred embodiment of the invention, 

the protein comprising an hximanized antibody variable 
region is a Fab or F(ab')2 or Fab fragment. 

Proteins according to the present invention may 
be made by methods taught herein and in co-pending, co- 

10 ' owned U.S. patent application no. 07/808,4 64 by Studnicka 
et al. incorporated by reference herein; and modified 
antibody variable domains made by such methods may be used 
in therapeutic administration to hximans either alone or as 
part of an immunoconjugate as taught in co-owned, co- 

15 pending U.S. Patent Application No. 07/787,567 by Better et 
al. 

The present invention also provides methods for 
preparing a modified antibody variable domain useful in 
preparing immunotoxins and immunofusions by determining the 

20 amino acids of a subject antibody variable domain which may 

be modified without diminishing the native affinity of the 
domain for antigen while reducing its immunogenicity with 
respect to a heterologous species. As used herein, the 
term "subject antibody variable domain" refers to the 

25 antibody upon which determinations are made. The method 
includes the following steps: determining the amino acid 
sequence of a subject light chain and a subject heavy chain 
of a subject antibody variable domain to be modified; 
aligning by homology the subject light and heavy chains 

30 with a plurality of human light and heavy chain amino acid 
sequences; identifying .the amino acids in the subject light 
and heavy chain sequences which are least likely to 
diminish the native affinity of the subject variable domain 
for antigen while, at the same time, reducing its 

35 immunogenicity by selecting each amino acid which is not in 

an interface region of the subject antibody variable domain 
and which is not in a complementarity-determining region or 
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in an antigen-binding region of the subject antibody 

variable domain, but which amino acid is in a position 

exposed to a solvent containing the antibody; changing each 

residue identified above which aligns with a highly or a 
5 moderately conserved residue in the plurality of hiunan 

light and heavy chain amino acid sequences if said 

identified amino acid is different from the amino acid in 

the plurality. 

/mother group of sequences, such as those in 
10 Figures lA and IB may be used to determine an alignment 

from which the skilled artisan may determine appropriate 

changes to make. 

The present invention provides a further method 

wherein the plurality of human light and heavy chain amino 
15 acid sequences is selected from the human consensus 

sequences in Figures lOA and lOB. 

In general, human engineering according to the 

above methods may be used to treat various diseases against 

which monoclonal antibodies generally may be effective. 
20 However, humanized antibodies possess the additional 

advantage of reducing the immunogenic response in the 

treated patient • 

Additional aspects and applications of the 

present invention will become apparent to the skilled 
25 artisan upon consideration of the detailed description of 

the invention which follows. 

BRIEF DE8CRIPTIOK OF THE DRAWINGS 
FIG. 1 is a computer-generated alignment of the 
amino acid sequence of the ricin A-chain (RTA) (SEQ ID NO: 
30 1) with the amino acid sequence of the Type I ribosome- 

inactivating protein gelonin (SEQ ID NO: 2), wherein 
starred positions indicate amino acids invariant among the 
ricin A-chain and the Type I RIPs; 

FIG. 2 is a computer-generated alignment of the 
35 amino acid sequence of the ricin A-chain (SEQ ID NO: 1) 
with the amino acid sequence of the Type I ribosome- 
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inactivating protein BRIP (SEQ ID NO: 3), wherein starred 
positions indicate amino acids invariant among the ricin A- 
chain and the Type I RiPs; 

FIG. 3 is a computer-generated alignment of the 
5 amino acid sequence of the ricin A-chain (SEQ ID NO: 1) 
with the amino acid sequence of the Type I ribosome- 
inactivating protein momordin 11 (MOMOII) (SEQ ID NO: 4)^ 
wherein starred positions indicate amino acids invariant 
among the ricin A-chain and the Type I RiPs; 

10 ' FIG. 4 is a computer-generated alignment of the 

amino acid sequence of the ricin A-chain (SEQ ID NO: 1) 
with the amino acid sequence of the Type I ribosome- 
inactivating protein luff in (SEQ ID NO* 5), wherein starred 
positions indicate amino acids invariant among the ricin A- 
. 15 chain and the Type I RIPs; 

FIG* 5 is a computer-generated alignment of the 
amino acid sequence of the ricin A-chain (SEQ ID NO: 1) 
with the amino acid secjuence of the Type i ribosome- 
inactivating protein atrichosanthin (TRICHO) (SEQ ID NO: 

20 6) , wherein starred positions indicate amino acids 

invariant among the ricin A-chain and the Type I RIPs; 

FIG. 6 is a computer-generated alignment of the 
amino acid sequence of the ricin A-chain (SEQ ID NO: 1) 
vith the amino acid sequence of the Type I ribosome- 

25 inactivating protein momordin I (wOMOl) (SEQ ID NO: 7) , 

wherein starred positions indicate amino acids invariant 
among the ricin A-chain and the Type I RIPs; 

FIG. 7 is a computer-generated alignment of the 
amino acid sequence of the ricin A-chain (SEQ ID NO: 1) 

30 with the amino acid sequence of the Type I ribosome- 
inactivating protein Mirabilxs anti-viral protein (M7^) 
(SEQ ID NO: 8), wherein starred positions indicate amino 
acids invariant among the ricin A-chain and the Type I 
RIPs; 

FIG. 8 is a computer-generated alignment of the 
amino acid sequence of the ricin A-chain (SEQ ID NO: 1) 
with the amino acid sequence of the Type I ribosome- 
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inactivating protein pokeweed antiviral protein from seeds 
(PAPS) (SEQ ID NO: 9), wherein starred positions indicate 
amino acids invariant among the ricin A-^chain and the Type 
I RIPs; 

5 FIG. 9 is a computer-generated alignment of the 

amino acid sequence of the ricin A-chain (SEQ ID NO: 1) 
with the amino acid sequence of the Type I ribosome- 
inactivating protein saporin 6 (SAP6) (SEQ ID NO: 10), 
^ wherein starred positions indicate amino acids invariant 

10 among the ricin A-chain and the Type I RIPs; 

FIGS. lOA and lOB are alignments of the consensus 
amino acid sequences for the subgroups'*!. of light [hKl (htiman 
kappa light chain subgroup 1) , XH3 (hxman kappa light chain 
subgroup 3), hK2 (human kappa light chain subgroup 2), hLl 

15 (human lambda light chain subgroup l) , hL2 (himan lambda 

light chain subgoup 2) , hL3 (human lambda light chain 
subgroup 3), hL6 (htiman lambda light chain subgroup 6), hK4 
(human kappa light chain subgroup 4) , hL4 (human lambda 
light chain subgroup 4) and hL5 (human lambda light chain 

20 subgroup 5] and heavy chains [hH3 (human heavy chain 

subgroup 3), hHl (human heavy chain subgroup 1) and hH2 
(human heavy chain subgroup 2)], respectively, of human 
antibody variable domains; 

FIG. 11 sets out the nucleotide sequences of the 

25 oligonucleotides utilized in the construction of the genes 

encoding modified V/J-regions of the light and heavy chains 
of the H65 mouse monoclonal antibody variable domain; and 
FIGs. 12A and 12B are alignments of human light: 
chain consensus hKl and heavy chain consensus hHl with the 

30 light and heavy chain sequences, respectively, of the 

variable domain of human antibody EU, hximan antibody TAG, 
hiiman antibody TAG modified according to the present 
invention (prop) and human antibody TAG modified according 
to a different method (Que) . 
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DETAILED DESCRIPTION 

Nucleotide sequences of genes encoding three 
plant Type I RIPs and expression vectors containing the 
genes are provided by the present invention. A first plant 
RIP, gelonin, is produced by seeds of Geloniujn multiflorum, 
a plant of the Euphorbiaceae family native to the tropical 
forests of eastern Asia, while a second plant RIP, BRIP, is 
synthesized by the common cereal grain barley. Momordin 
II, a third plant RIP, is produced in Momordica balsamina 
seeds. Analogs of BRIP are also provided by the present 
invention. The analogs were genetically engineered to 
include a cysteine free to participate in a intermolecular 
disulfide bond and were conjugated to antibody molecules 
without non-specific chemical derivatization of the RIP 
15 with crosslinking agents. 

Type I RIP analogs of the present invention offer 
distinct advantages over the natural proteins for use as 
components of immunotoxins. Chemical treatment to 

introduce free sulfhydryl groups in the natural proteins 
20 lacking free cysteines typically involves the non-selective 

modification of amino acid side chains. This non- 
selectivity often results in antibodies conjugated to 
different sites on different RIP molecules (i.e., a 
heterogeneous population of conjugates) and also in a 
25 decrease in RIP activity if antibodies are conjugated in or 

near important regions of the RIP (e.g., the active site or 
regions involved in translocation across cell membranes) • 
In contrast, RIP analogs according to the present invention 
may be conjugated to a single antibody through a disulfide 
bond to a specific residue of the analog resulting in 
reduced batch to batch variation of the immunoconjugates 
and, in some cases, immunoconjugates with enhanced 
properties (e.gr., greater cytotoxicity or solubility). 

Type I plant RIPs, as well as bacterial RIPs such 
35 as Shiga and shiga-like toxin A-chains, are homologous to 

the ricin A-chain and are useful in the present invention. 
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Type I RIPs may be defined and sites for 
substitution of a cysteine in a RIP may be identified by 
— compssring" th"e~pT±in3rry-aTBrno-ac the-RIP to the 

natural ricin A-chain amino acid sequence, the tertiary 
5 structure of which has been described in Katzin et al.. 

Proteins^ 10:251-259 (1991), which is incorporated by 
reference herein. 

TVmino acid sequence alignment defines Type I RIPs 
' in that the ricin A-chain and the Type I plant RIPs have 

10 nine invariant amino acids in common • Based on the ricin 
sequence the invariant amino acids are tyrosineji, 
argininezs, tyrosineeof tyrosine^ia, leucinej4«# glutamic 
acidi77, alaninei7B, arginineiBor and tryptophanzn . The ricin 
A-chain may be used as a model for the three-dimensional 

15 structure of Type I RIPs. A protein lacking a cysteine 

available for conjugation while having ribosome- 
inactivating activity and having the nine invariant amino 
acids when its primary sequence is compared to the primary 
sequence of the ricin A-chain [according to the alignment 

20 algorithm of Myers et al., CAB JOS COMMUNICATIONS, 4 (2) Hi- 

ll (1988), implemented by the PC/GENE program PALIGN 
(Intelligenetics, Inc», Mountain View, California) and 
utilizing the Dayhoff Mutation Data Matrix (MDM-78) as 
described in Schwartz et al., pp. 353-358 in Atlas of 

25 Protein Sequence and Structure, 5 Supp. 3, National 

Biomedical Research Foundation, Washington, D.C. (1978)] is 
defined as a Type I RIP herein and is expected to be useful 
in the present invention. "Corresponding" refers herein to 
amino acid positions which align when two amino acid 

30 sequences are compared by the strategy of Myers et ai., 

supra. 

The primary amino acid sequences of the Type I 
RIPs:gelonin, BRIP, momordin II, luff in [see Islam et al.. 
Agricultural Biological Chem., 54 (^5; : 1343-1345 (199)], 
35 atrichosanthin [see Chow et al., J. Biol. Chem., 255:8670- 

8674 (1990)), momordin I [see Ho et al., BBA, 1088:311-314 
(1991)1, Mirabilis anti-viral protein [see Kabuka et al.. 
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J. Biol. Chem., 264(12)16629-6637 (1989)], pokeweed 
antiviral protein isolated from seeds [see Kung et al,, 
Agric, Biol. Chem., 54 f22; :3301-3318 (1990)) and saporin 
[see Benatti et al., Eur. j. Biochem., 163:465-470 (1989)] 
are individually aligned with the primary sequence of the 
ricin A-chain [see Hailing et al.. Nucleic Acids Res., 
23:8019-8033 (1985)] in FIGs 1-9, respectively, according 
to the. algorithm of Myers et al., supra, as specified 
above . 

FIGS. 1-9 may be utilized to predict the amino 
acid positions of the Type I RiPs where cysteine residues 
may be substituted. Preferred amino acids for cysteine 
substitution are on the surface of the molecule and include 
any solvent accessible amino acids which will not interfere 
with proper folding of the protein if replaced with a 
cysteine. A region of the ricin A-chain comprising such 
amino acids is the carboxyl terminal region. Amino acids 
that should be avoided for replacement are those critical 
for proper protein folding, such as proline, and those that 
are solvent inaccessible. Also to be avoided are the nine 
amino acids invariant among RiPs, and the amino acids in or 
near regions comprising the active site of ricin A-chain as 
depicted in Figure 6 of Katzin et al., supra. 

Therefore, a preferred region of substitution for 
Type I RIPS is their carboxyl terminal region which is 
solvent accessible and corresponds to the carboxyl terminal 
region where Type II rip A-chains and B-chains are 
naturally linked by a disulfide bond. As shown in the 
examples, a cysteine may be substituted in positions in the 
amino acid sequence of a Type I RIP from the position 
corresponding to position 251 in SEQ id no: i to the 
carboxyl terminal position of said Type I rip, resulting in 
RIP analogs which retain enzymatic activity and gain 
disulfide cross- linking capability, one preferred cysteine 
35. substitution position is near the position which 
corresponds to the cysteine at position 259 in the ricin A- 
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For purposes of the present invention, 
immunotoxins comprise a class of compounds of which toxin- 

— ^a-ntitoody^^f usions ^a-nd — inonunoeon j-ugates are — ex-amp 1 es . 

Immunotoxins are particularly suited for use in treatment 
5 of human autoimmune diseases and in the treatment of 
diseases in which depletion of a particular cell type is a 
goal, such as cancer. For example, treatment of autoimmune 
diseases with immunotoxins is described in International 
- Publication No. WO89/06968 published August 10, 1989, which 

10 is incorporated by reference herein. 

In any treatment regimen, the immunotoxins may be 
administered to a patient either sin'^ly or in a cocktail 
containing two or more immunotoxins, other therapeutic 
agents, compositions, or the like, including, but not 

15 limited to, immunosuppressive agents, tolerance-inducing 

agents, potentiators and side-effect relieving agents. 
Particularly preferred are immunosuppressive agents useful 
in suppressing allergic reactions of a host. Preferred 
immunosuppressive agents include prednisone, prednisolone, 

20 DECADRON (Merck, Sharp & Dohme, West Point, Pennsylvania), 

cyclophosphamide, cyclosporine, 6-mercaptopurine, 
methotrexate, azathioprine and i.v. gamma globulin or their 
combination. Preferred potentiators include monensin, 
ammonium chloride, perhexiline, verapamil, amantadine and 

25 chloroquine. All of these agents are administered in 

generally-accepted efficacious dose ranges such as those 
disclosed in the Physician^s Desk Reference, 41st Ed., 
Publisher Edward R. Barnhart, New Jersey (1987) . Patent 
Cooperation Treaty (PCT) patent application WO 89/069767 

30 published on August 10, 1989, discloses administration of 
an immunotoxin as an immunosuppressive agent and is 
incorporated by reference herein. 

Immunotoxins of the present invention may be 
formulated into either an injectable or topical 

35 preparation. Parenteral formulations are known and are 
suitable for use in the invention, preferably for 
intramuscular or intravenous administration. The 
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formulations containing therapeutically-ef f ective amounts 
__.?lJ^^^o?^i^_ are either sterile l iquid soTj>rH»»o n^^^ 
suspensions, or lyophilized versions, and " optionally 
contain stabilizers or excipients. Lyophilized 
compositions are reconstituted with suitable diluents, 
e.sr., water for injection, saline, o.3% glycine and the 
like, at a level of about from 0.01 mg/kg of host body 
weight to 10 mg/kg where the biological activity is less 
than or equal to 20 ng/ml when measured in a reticulocyte 
lysate assay. Typically, the pharmaceutical compositions 
containing immunotoxins of the present invention are 
administered in a therapeutically effective dose in a range 
of from about 0.01 mg/kg to about 5 mg/kg of the patient. 
A preferred, therapeutically effective dose of the 
pharmaceutical composition containing immunotoxins of the 
invention is in a range of from about o.oi mg/kg to about 
0.5 mg/kg body weight of the patient administered over 
several days to two weeks by daily intravenous infusion, 
each given over a one hour period, in a sequential patient 
20 dose-escalation regimen. 

Immunotoxin compositions according to the 
invention may. be formulated into topical preparations for 
local therapy by including a therapeutically effective 
concentration of immunotoxin in a dermatological vehicle. 

25 The amount of immunotoxin to be administered, and the 
immunotoxin concentration in the topical formulations, 
depend upon the vehicle selected, the clinical condition of 
the patient, the systemic toxicity and the stability of the 
immunotoxin in the formulation. Thus, a physician knows to 

30 employ the appropriate preparation containing the 
appropriate concentration of immunotoxin in the 
formulation, as well as the appropriate amount of 
formulation to administer depending upon clinical 
experience with the patient in question or with similar 

35 patients. The concentration of immunotoxin for topical 
formulations is in the range of greater than from about 0.1 
mg/ml to about 25 mg/ml. Typically, the concentration of 
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immunotoxin for topical formulations is in the range of 
greater than from about 1 mg/ml to about 20 mg/ml. Solid 
dispersions of immunotoxins according to the invention, as 
well as solubilized preparations, may be used. Thus, the 
5 precise concentration to be used in the vehicle is subject 

to modest experimental manipulation in order to optimize 
the therapeutic response. For example, greater than about 
10 mg immunotoxin/ 100 grams of vehicle may be useful with 
1% w/w hydrogel vehicles in the treatment of skin 

10 inflammation • Suitable vehicles, in addition to gels, are 

oil-in-water or water-in-oil emulsions using mineral oils, 
petroleum and the like. ^ 

Immunotoxins according to the invention may be 
optionally administered topically by the use of a 

15 transdermal therapeutic system [Barry, Dermatological 

Formulations, p. 181 (1983) and literature cited therein] • 
While such topical delivery systems may be designed for 
transdermal administration of low molecular weight drugs, 
they are capable of percutaneous delivery. Further, such 

20 systems may be readily adapted to administration of 

immunotoxin or derivatives thereof . and associated 
therapeutic proteins by appropriate selection of the rate- 
controlling microporous membrane. 

Topical preparations of immunotoxin either for 

25 systemic or local delivery may be employed and may contain 
excipients as described above for parenteral administration 
and other excipients used in a topical preparation such as 
cosolvents, surfactants, oils, hmnectants, emollients, 
preservatives, stabilizers and antioxidants. 

30 Pharmacologically-acceptable buffers may be used, e.g., 
Tris or phosphate buffers. The topical formulations may 
also optionally include one or more agents variously termed 
enhancers, surfactants, accelerants, adsorption promoters 
or penetration enhancers, such as an agent for enhancing 

35 percutaneous penetration of the immunotoxin or other 

agents. Such agents should desirably possess some or all 
of the following features as would be known to the 
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10 



15 



ordinarily skilled artisan: pharmacological inertness, 
_non-promotivej3f^ 

with immunotoxin (non-inactivating), and capable of 
formulation into creams, gels or other topical delivery 
systems as desired. 

Immunotoxins according to the present invention 
may also be administered by aerosol to achieve localized 
delivery to the lungs. This is accomplished by preparing 
. an aqueous aerosol, liposomal preparation or solid 
= particles containing immunotoxin. Ordinarily, an aqueous 
aerosol is made by formulating an aqueous solution or 
suspension of immunotoxin together with conventional 
pharmaceutically acceptable carriers and stabilizers. The 
carriers and stabilizers vary depending upon the 
requirements for the particular immunotoxin, but typically 
include: nonionic surfactants (Tweens, Pluronics, or 
polyethylene glycol); innocuous proteins like serum 
albumin, sorbitan esters, oleic acid, lecithin; amino acids 
such as glycine; and buffers, salts, sugars or sugar 
20 alcohols. The formulations may also include mucolytic 
agents as well as bronchodilating agents. The formulations 
are sterile. Aerosols generally are prepared, from- isotonic 
solutions. The particles optionally include normal lung 
surfactants. 

Alternatively, immunotoxins of the invention may 
be administered orally by delivery systems such as 
proteinoid encapsulation as described by Steiner, et al., 
U.S. Patent No. 4,925,673, incorporated by reference 
herein. Typically, a therapeutical ly-effective oral dose 
30 of an immunotoxin according to the invention is in the 
range from about 0.05 mg/kg body weight to about 50 mg/kg 
body weight per day. A preferred effective dose is in the 
range from about 0.05 mg/kg body weight to about 5 mg/kg 
body weight per day. 

Immunotoxins according to the present invention 
may be administered systemically, rather than topically, by 
injection intramuscularly, subcutaneously, intrathecally or 
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intraperitoneally or into vascular spaces, particularly 
into the joints, e.g., intraarticular injection at a dosage 
of* greater than about 1 tig/cc joint fluid/day. The dose 
will be dependent upon the properties of the specific 
5 inonunotoxin employed, e.g., its activity and biological 

half-life, the concentration of inmiunotoxin in the 
formulation, the site and rate of dosage, the clinical 
tolerance of the patient involved, the disease afflicting 
' the patient and the liXe, as is well within the skill of 

10 the physician* 

The immunotoxins of the present invention may be 
administered in solution. The pH of tfte solution should be 
in the range of pH 5 to 9-5, preferably pH 6.5 to 7.5. The 
immunotoxin or derivatives thereof should be in a solution 

15 having a suitable pharmaceutically-acceptable buffer such 

as phosphate, Tris (hydroxymethyl) aminomethane-HCl or 
citrate and the like. Buffer concentrations should be in 
the range of 1 to 100 mM. The immunotoxin solution may 
also contain a salt, such as sodi\im chloride or potassium 

20 chloride in a concentration of 50 to 150 mM. An effective 

amount . of a stabilizing agent such as an albumin, a 
globulin, a gelatin, a protamine or a salt of protamine may 
also be included, and may be added to a solution containing 
immunotoxin or to the composition from which the solution 

25 is prepared. 

Systemic administration of immunotoxin may be 
made daily and is generally by intramuscular injection, 
although intravascular infusion is acceptable. 
Administration may also be intranasal or by other 

30 nonparenteral routes. Immunotoxins of the present 

invention may also be administered via microspheres, 
liposomes or other microparticulate delivery systems placed 
in certain tissues including blood. Topical preparations 
are applied daily directly to the skin or mucosa and are 

35 then preferably occluded, i.e., protected by overlaying a 

bandage, polyolefin film or other barrier impermeable to 
the topical preparation. 
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The following Examples are illustrative of 
practice of the invention but are not to be construed as 
limiting the invention. The present application is broadly 
organized as follows. The first portion of the application 
5 broadly teaches the preparation, expression and properties 

of an exemplary RIP, gelonin. A second portion of the 
application teaches the preparation of human-engineered 
antibodies. A third portion of the application teaches the 
construction and properties of immunoconjugates, comprising 
an RIP and an antibody or fragment thereof comprising an 
antigen-binding portion. a forth portion of the 

application relates to the preparation and properties of 
immunofusion proteins comprising an RIP and an antibody or 
fragment thereof comprising an antigen-binding portion. A 
fifth portion of the application teaches the preparation 
and properties of the rip Barley ribosome-inactivating 
protein and a final aspect of the invention provides the 
preparation and properties of the rip momordin. 

Specifically, Example i relates to the 
preparation of the RIP gelonin. Construction of expression 
vector, comprising the gelonin gene, including expression 
and purification of gelonin, is taught in Example 2. The 
assembly of gelonin genes with cysteine residues available 
for conjugation is taught in Example 3 and Example 4 
provides results of a reticulocyte lysate assay performed 
on gelonin. 

Example 5 teaches the construction of human- 
engineered antibodies for use in immunotoxins of the 
invention and Example 6 demonstrates transfection of he3 
genes, expression of those genes, and purification of the 
products thereof. 

Example 7 next teaches the preparation of gelonin 
immunoconjugates. The procedures and results of whole cell 
kill assays are next presented in Example 8. Various 
properties of gelonin immunoconjugates are taught in 
Example 9 and Examples 10 and ii teach the pharmacokinetics 
of two types of immunoconjugates. Examples 12 and 13 teach 
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the immuTiogenicity of immunoconjugates of the invention and 
the In vivo efficacy of those iTrununoconjugates, 

— rBrsp^ctively-;^ 

The construction of genes encoding gelonin 
5 iininunof usions is taught in Examples 14, 15, 16, 17 and 18. 

Example 19 teaches alternative cathepsin cleavable linkers 
for use in the immunofusions of the invention. The 
expression and purification of various genes encoding 
immunoconjugates are presented in Example 20 and their 
10 activity properties are presented in Example 21. 

The construction of genes encoding the RIP, BRIP^ 
and its expression and properties are- taught in Examples 
22, 23, and 24. 

Finally, construction of genes encoding momordin 
15 and properties of momordin on expression are taught in 

Example 25. 

Example l 

Preparation Of Gelonin 

The cloning of the gelonin gene according to the 

20 present invention obviates the requirement of purifying the 

RIP gene product from its relatively scarce natural source, 
G. multifloirvm seeds. Cloning also allows development of 
gelonin analogs which may be conjugated to antibodies 
without prior chemical derivatization and also allows 

25 development of gelonin gene fusion products. 

A. Preparation Of RNA From G. Multiflorum Seeds 

Total RNA was prepared from Gelonium seeds (Dr. 
Michael Rosenblum, M.D. Anderson Cancer Center, Houston, 
Texas) by a modification of the procedure for preparation 

30 of plant RNA described in Ausubel et al., eds,. Current 

Protocols in Molecular Biology, Wiley & Sons-, 1989. 
Briefly, 4.0 grams of seeds were ground to a fine powder in 
a pre-cooled (-70 *C) mortar and pestle with liquid Nj. The 
powder was added to 25 ml Grinding buffer (0.18M Tris, 

35 0.09H LiCl, 4.5mW EDTA, 1% SDS, pK 8.2) along with 8.5 ml . 
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of phenol equilibrated with TLE (0.2M Tris, O.IM LiCl, 5mM 
EDTA pH8.2). The mixture was homogenized using a Polytron 
PT-1035 (#5 setting). 8.5 ml of chloroform was added, 
mixed and incubated at 50 for 20 minutes. The mixture 
5 was centrifuged at 3000 g for 2 0 minutes in a rotor 
precooled to 4*C and the aqueous phase was transferred to 
a new tube. 8.5 ml of phenol was added followed by 8.5 ml 
of chloroform and the mixture was recentrifuged. This 
extraction was repeated 3 times. The RNA in the aqueous 

10 1 phase was then precipitated by adding 1/3 volvime 8M LiCl, 
and incubated at 4*C for 16 hours. Next, the RNA was 
pelleted by centrifugation for 20 minutes at 4*C. The 
pellet was washed with 5 ml of 2M LiCX, recentrifuged and 
resuspended in 2 ml of water. The RNA was precipitated by 

15 addition of NaOAc to 0.3M and 2 volumes of ethanol^ The 

3WA was stored in 7 0% ethanol at -70 'C. 

B* cDNA Preparation 

cDNA was prepared from total Gelonium RNA by two 
methods. The first method involved making a cDNA library 
in the bacterial expression plasmid pcDNAII using the 
Librarian II cDNA Library Construction System kit 
(Invitrogen) . Approximately 5 of total RNA was 

converted to first strand cDNA with a l:i mixture of random 
primers and oligo-dT. Second strand synthesis with DNA 
25 polymerase I was performed as described by the system 

manufacturer. Double stranded cDNA was ligated to BstXl 
linkers and size fractionated. Pieces larger than about 
500 bp were ligated into the expression vector provided in 
the kit. Individual vectors were introduced into E. coli 
30 either by transformation into high-efficiency competent 
cells or by electroporation into electrocompetent cells. 
Electroporation was performed with a BTXIOO unit (BTX, San 
Diego, CA) in 0.56/i Flatpack cells as recommended by BTX 
based on the method of Dower et al.. Nucleic Acids Res., 
35 15:6127-6145 (1988), at a voltage amplitude of 850 V and a 
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pulse length of 5 mS. The resulting library consisted of 
approximately 150,000 colonies* 

Th^e_-S.acjDnd^inethQd__inv.oJLv_ed_-generating cDNA using 

the RNA-PCR kit sold by Perkin-Elmer-Cetus. About 100 ng 
5 of total Gelonium RNA was used as template for cDNA 

synthesis. 

C, Determination Of The Gelonin Protein Sequence 

The partial secjuence of the native gelonin 
protein was determined by direct amino acid sequence 
10 analysis using automated Edman degradation as recommended 

by the manufacturer using an Applied Biosystems model 470A 
protein sequencer. Proteolytic peptide fragments of 
gelonin (isolated from the same batch of seeds as the total 
RNA) were sequenced. 

15 D. cloning Of The Gelonin Gene 

Three overlapping gelonin cDNA fragments were 
cloned and a composite gelonin gene was assembled from the 
three fragments. 

1. Cloning Of The Fragment Encoding The Middle 
20 Amino Acids Of Gelonin In Vector PING3823 

Degenerate DNA primers based on the gelonin 

partial amino acid sequences were used to PCR-amplify 

segments of the cDNA generated with Perkin^Elmer-Cetus kit. 

Six primers were designed based on regions of the gelonin 

25 amino acid sequence where degeneracy of the primers could 

be minimized. Appropriate pairs of primers were tested for 
amplification of a gelonin gene fragment. Products of the 
expected DNA size were identified as ethidium bromide- 
stained DNA bands on agarose gels that DNA was treated with 

30 T4 DNA polymerase and then purified from an agarose gel. 

Only the primer pair consisting cf primers designated gelo- 
7 and gelo-5 yielded a relatively pure product of the 
expected size. The sequences of degenerate primers gelo-7 
and gelo-5 are set out below using lUPAC nucleotide 

35 symbols. 
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Gelo-7 (SEQ ID NO: 14) 

5' TTYAARGAYGCNCCNGAYGCNGCNTAYGARGG 3' 

Gelo-5 (SEQ ID NO: 15) 

31 TTYTTYATRATRCANTGNCGNCANCTRGTYCA 5« 
5 Primer gelo-7 corresponds to amino acids 87-97 of gelonin 
while primer gelo-5 corresponds to amino acids 226-236. 
The blunt-ended DNA fragment (corresponding to amino acids 
87 to 236 of gelonin) generated with primers gelo-7 and 
gelo-5 was cloned into pucis (BRL, Gaithersburg, Maryland) i 
10 . The DNA sequence of the insert was determined, and the 
deduced amino acid sequence based on the resulting DNA 
sequence matched the experimentally determined gelonin 
amino acid sequence. The clone containing this gelonin 
segment was denoted pING3726. 

The insert of clone pING3726 was labeled with "P 
and used as a probe to screen the 150, OOO-member Gelonium 
cDNA library. Only one clone hybridized to the library 
plated in duplicate. This clone was purified from the 
library and its DNA sequence was determined. The clone 
contains a fragment encoding 185 of the 270 amino acids of 
gelonin (residues 25-209) and is denoted pING3823. 

2. Cloning Of The Fragment Encoding 

The N-Termi nal Amjno Acids Of Gelonin 

Based on the sequence determined for the gelonin 
25 gene segment in pING3726, exact oiigonucleotide primers 
were designed as PCR amplification primers to be used in 
conjunction with a degenerate primer to amplify a 5' 
gelonin gene fragment and with a nonspecific primer to 
amplify a 3' gelonin gene fragment. cDNA generated using 
30 the PerKin-Elmer-Cetus RNA-PCR kit was amplified. 

To amplify the 5 '-end of the gelonin gene, PCR 
amplification with a degenerate primer gelo-l and an exact 
primer gelo-10 was performed. The sequences of the primers 
are set out below. 
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Gelo-1 (SEQ ID NO: 16) 

5' GGNYTNGAYACNGTNWSNTTYWSNACNAARGG 3« 
GGlo-lO (SEQ ID NO: 17) 
3 1 TGTCTGAACCCGTAACTTGGTAA 5* 
5 Primer gelo-1 corresponds to amino acids 1-11 of the 

gelonin gene while primer gelo-10 corresponds to amino 
acids 126-13 3. The product from the reaction was re- 
amplified with gelo-1 (SEQ ID NO: 16) and gelo-11 (an exact 
, primer comprising sequences encoding amino acids 119-125 of 
10 gelonin) to confer specificity to the reaction product. 

The sequence of primer gelo~ll is listed below. 

Gelo-11 (SEQ ID NO: IS)-*. 
3» CACTCTTCCGTATATCTCTCTGT 5» 
Hybridization with an internal probe confirmed that the 
15 desired specific gelonin DNA fragment was amplified. That 

fragment was cloned into pUC18 and the vector generated was 
designated pING3727. The fragment was sequenced, revealing 
that the region of the fragment (the first 27 nucleotides) 
corresponding to part of the degenerate primer gelo-1 could 
20 not be translated to yield the amino acid sequence upon 

which primer gelo-1 was originally based. This was not 
unexpected considering the degeneracy of the primer. The 
fragment was reamplified from the Gelonium cDNA with exact 
primers geIo-11 (SEQ ID NO: 18) and gelo-5» (which extends 
25 upstream of the 5' end of the gelonin gene in addition to 
encoding the first 16 amino acids of gelonin) . The 
sequence of primer gelo-5* is set out below. 

Gelo-5' (SEQ ID NO: 19) 
5* TCAACCCGGGCTAGATACCGTGTCAT 
30 TCTCAACCAAAGGTGCCACTTATATTA 3 ». 

The resulting DNA fragment encodes the first 125 amino 
acids of gelonin- While the majority of the sequence is 
identical to the natural gelonin gene, the first 32 
nucleotides of the DNA fragment may be different. For the 
35 purposes of this application this N-terminal fragment is 
referred to as fragment GELl-125. 
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3. Cloning Of The Fragment Encoding 

The C-T erminal Amino Acids of Gelonin 

To-ainp-]rirf-y-the-^--^^xJ-o^the-ge-lon-in-gene as- well 

as 3' untranslated sequences, PCR amplification with exact 
5 priners gelo-9 and XE-dT was performed. The sequence of 
each of the primers is set out below. 

Gelo-9 (SEQ ID NO: 20) 
5» CTTCATTTTGGCGGCACGTATCC 3« 
XE-dT (SEQ ID NO: 21) 
10 . 3' TTTTTTTTTTTTTTTTTTTTTAG 

GGTGCATTCGAACGTCGGAGCTC 5 • 
Primer gelo-9 corresponds to amino acids 107-113 of 
gelonin. Primer XE-dT consists of a* 3 • oligo-dT portion 
and a 5' portion containing the restriction sites Hindlll 
15 and Xhol, and will prime any poly A-containing cDNA. The 
reaction product was reamplified with exact primers gelo-8 
and XE. The sequences of primers gelo-8 and XE are set out 
below. 

Gelo-8 (SEQ ID NO: 22) 
20 5» CTCGCTGGAAGGTGAGAA 3' 

XE (SEQ ID NO: 23) 
3' AGGGTGCATTCGAACGTCGGAGCTC 5» 
Primer gelo-8 consists of sequences encoding amino acids 
115-120 of gelonin while the primer XE corresponds to the 
25 5' portion of the XE-dT primer which contains Hindlll and 
Xhol restriction sites. Hybridization with internal probes 
confirmed that the desired gelonin gene fragment was 
amplified. That fragment was then cloned into puciB by two 
different methods. First, it was cloned as a blunt-ended 
30 fragment into the Smal site of pt7Cl8 (the resulting vector 

was designated pING3728) and, second, it was cloned as an 
EcoRI to Hindlll fragment into pUC18 (this vector was 
designated pING3729) . Both vector inserts were sequenced. 
The insert of pING3728 encodes amino acids 114-270 of 
35 gelonin, while the insert of plNG3729 encodes amino acids 

184-270 of gelonin plus other 3' sequences. 
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4 . Assembly Of The Overlapping Gelonin DNA 
Fragments Into A Composite Gelonin Gene 

To reassemble the C-terminal two-thirds of the 

gelonin gene, vector pING3729 was cut with Sspl (one Sspl 

5 site is located within the vector and the second is located 

about 80 bp downstream of the termination codon of the 

insert in the vector) and an Xhol linker (8 bp. New England 

Biolabs) was ligated to the resulting free ends. The DNA 

was then cut with Xhol and EcoRl, and the 350 bp fragment 

10 generated, encoding amino acids 185-270 of gelonin, was 

isolated. This 350 bp fragment was ligated adjacent to a 
j^col to ^coRI fragment from pING3823 encoding amino acids 
37-185 of gelonin in a intermediate vector denoted 
pING3730, thus reassembling the terminal 87% of the gelonin 

15 gene (amino acids 37-270) . 

Next, fragment GELl-125 was cut with Smal and 
Ncol, resulting in a fragment encoding amino acids 1-36 of 
gelonin which was ligated along with the NcoJ to Xhol 
fragment of pING373 0 into the vector pIClOO. [pIClOO is 

20 identical to pING1500 described in Better, et al.. Science, 

240:1041-1043 (1988), incorporated by reference herein], 
except that it lacks 37 bp upstream of the pelB leader 
sequence. The 37 bp were eliminated by digestion of 
pING1500 with SphJ and JBcoRI, treatment with T4 polymerase, 

25 and religation of the vector. This manipulation 

regenerated an EcoRI site in the vector while eliminating 
other undesirable restriction sites.] Before ligation, the 
vector pIClOO had previously been digested with SstI, 
treated with T4 polymerase, and cut with Xhol. The 

30 ligation generated a new vector containing a complete 
gelonin gene which was designated plasmid pING3731 and 
deposited with The American Type Culture Collection, 12301 
Parklawn Drive, Rockville, Maryland 20852 on October 2, 
1981 as Accession No.. 68721. The complete DNA sequence of 

35 the gelonin gene is set out in SEQ ID NO: 11. 
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ExMple 2 

A. Construction Of Expression 

Vectors Containin g The Gelonin Gf>ni» 

A first E, coli expression vector was constructed 
5 containing the gelonin gene linked to the Erwinia 
carotovora pelB leader sequence, and to the Salmonella 
typhxjauriujb araB promoter. A basic vector containing the 
araB promoter is described in co-owned U.S. Patent NO. 
5,028,530 issued July 2, 1991 which is incorporated by 
10 ^ reference herein. The vector containing the araB promoter 
was cut with EcoRl and ^hol. Two DNA fragments were then 
ligated in tandem immediately downstream of the promoter. 
The fragment ligated adjacent to the f>romoter was a 131 bp 
fragment derived from 5stl digestion, T4 polymerase 
15 treatment and digestion with EcoHl of the pIClOO vector 
which includes the leader sequence of the E. carotovora 
pelB gene. The translated leader sequence is a signal for 
secretion of the respective protein through the cytoplasmic 
membrane. The fragment ligated downstream of the leader 
sequence was a Smal to Xhol fragment from plNG3731 which 
contains the complete gelonin gene. Thus, the expression 
vector contains the gelonin gene linked to the pelB leader 
sequence and the araB promoter. This plasmid is designated 
PING3733. 

25 A second expression vector may be constructed 

that is identical to the first except that the gelonin gene 
sequences encoding the nineteen C-terminal amino acids of 
gelonin are not included. The cDNA sequence of the gelonin 
gene predicted a 19 residue C-terminal segment that was not 

30 detected in any peptide fragments generated for 
determination of the gelonin amino acid sequence. These 19 
amino acids may represent a peptide segment that is cleaved 
from the mature toxin post-translationally, i.e. that is 
not present in the native protein. A similar C-terminal 

35 amino acid segment was identified in the plant toxin 

Q-trichosanthin [Chow et al., j, bxoI. Cbeia., 265:8670-8674 
(1990)]. Therefore, the expression product without the C- 
terminal fragment is of interest. 
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For construction of a gelonin expression vector 
without the 19 C-terminal amino acids of gelonin,. PCR was 
used to amplify-and-alter the 3 '-end of the gene. pING3728 
was amplified with primers gelo-14 and gelo-9 (SEQ ID NO: 
20) . The sequence of primer gelo-14 is set out below. 

Gelo-14 (SEQ ID NO: 24) 

5» TGATCTCGAGTACTATTTAGGATCTTTATCGACGA 3» 
Primer gelo-14, which corresponds to gelonin amino acids 
245-256, introduces a termination codon (underlined in the 
primer sequence) in the gelonin gene sequence which stops 
transcription of the gene before the sequences encoding the 
terminal 19 amino acids of the gelonin'^'and also introduces 
a Xhol site immediately downstream of the termination 
codon. The PCR product was cut with Xhol and .BcoRI, and 
the resulting 208 bp fragment encoding amino acids 185-251 
of gelonin was purified from an agarose gel. This fragment 
was ligated adjacent to the Ncol to JETcoRI fragment from 
pING3 823 encoding amino acids 37-185 of gelonin to generate 
plasmid pING3732. A final expression vector, pING3734, 
containing a gelonin gene with an altered 3 '-end was 
generated by substituting an Ncol to Xhol fragment encoding 
amino acids 37-251 of gelonin from pING3732 into pING3733. 

B. Identification Of The Native Gelonin 5* -End 

Inverse PCR was used to identify a cDNA clone 
25 encoding the 5 '-end of the mature gelonin gene. 5 /xg of 
total G. multiflorum RNA was converted to cDNA using the 
Superscript Plasmid System (BRL, Gaithersburg, Maryland) 
with Gelo-11 (SEQ ID NO: 18) as a primer. Gelonin cDNA was 
self -ligated to generate covalent circular DNA and the 
30 ligated DNA was amplified by PCR with oligonucleotides 

Gelo-9 (SEQ ID NO: 20) and Gelo-16. The sequence of primer 
Gelo-16 is set out below. 

Gelo-16 (SEQ ID NO: 25) 
5 ' GTAAGCAGCATCTGGAGCATCT 3 • 
35 The PCR product was size-fractionated on an agarose gel and 
DNAs larger than 3 00 bp were cloned into Smal cut pUC18. 
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Several clones were sequenced with the primer Gelo-18, the 
sequence of which is set out below. 

'Gelb-18~ (SEQ"ID'N0: TS) • 

51 CATTCTU^GAAATTCACGTAGG 3» 
5 A clone identified as having the largest gelonin-specif ic 
insert was designated plNG3826. The DNA sequence of 
PING3826 included the first 32 nucleotides of the natural/ 
mature gelonin gene not necessarily present in gelonin 
expression plasmids pING3733 and plNG3734. The complete 
10 . DNA sequence of the natural gelonin gene is set put in SEQ 
ID NO: 11. 

C. Construction Of Expression Vectors 

Containing A Gelonin Gen e with A Natu-ral 51 End 

Derivatives of expression vectors pING3733 and 
PING3734 (described above) containing a gelonin gene with 
the natural 5« sequence were generated as follows. The 5'- 
end of gelonin was amplified from pING3826 with the PCR 
primers Gelo-16 (SEQ ID NO: 24) and Gelo-17, the sequence 
of which is set out below. 
20 Gelo-17 (SEQ ID NO: 27) 

5 • GGCCTGGACACCGTGAGCTTTAG 3 • 
The 285 bp PCR product was treated with T4 polymerase and 
cut with Ncol. The resulting 100 bp 5 '-end DNA fragment 
was isolated from an agarose gel and ligated adjacent to 
25 the 120 bp pelB leader fragment from plClOQ (cut with Sstl, 
treated with T4 polymerase and cut with Pstl) into either 
P1NG3733 or PING3734 digested with Pstl and NcoJ. The 
resulting plasmids pING3824 and plNG3825 contain the entire 
native gelonin gene and the native gelonin gene minus the 
nineteen amino acid carboxyl extension, respectively, 
linked to the pelB leader and under the transcriptional 
control of the arah promoter. The gene construct without 
the nineteen amino acid carboxyl extension in both plNG3734 
and PING3825 encodes a protein product referred to in this 
35 application as "recombinant gelonin". 
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D. Purification Of Recombinant Gelonin 

Recombinant gelonin was purified by the following 
procedure: — E.-'coii-f ermentation broth- was- concentrated and 
buffer-exchanged to 10 mM sodi\m phosphate at pH 7.0 by 
5 using an SIOYIO cartridge over a DCIO unit (Amicon) the 

concentrated and buffer-exchanged material was applied to 
a CM52 column (100 q, 5X10 cm)* The column was washed with 
1 L of starting buffer and eluted with a 0 to 300 mM NaCl 
- gradient in starting buffer (750 ml total volume) . The 

10 pure gelonin containing fractions were pooled (elution was 

from 100-250 mM NaCl) , concentrated over an Amicon YMIO 
membrane^ equilibrated with 10 mM sodiom phosphate buffer, 
pH 7.0, and stored frozen at -20*C. A further purification 
step was attempted using Blue Toyopearl chromatography. 

15 However, this procedure did not result in an increased 

purity of material and resulted in an approximate 50% loss 
of the starting material. 

Example 3 

Assembly Of Gelonin Genes With 
2 0 Cysteine Residues Available For Conjugation 

The wild-type gelonin protein has two cysteine 

residues at positions 44 and 50 which are linked by an 

endogenous disulfide bond. The protein contains no free 

cysteine residue directly available for conjugation to 

25 antibodies or other proteins. Analogs of gelonin which 

contain a free cysteine residue available for conjugation 
were generated by three different approaches. In one 
approach, various residues along the primary sequence of 
the gelonin were replaced with a cysteine residue, creating 

30 a series of analogs which contain an odd number of cysteine 

residues. In another approach, one of the two endogenous 
cysteines was replaced by alanine, creating a molecule 
which lacks an intrachain disulfide bond but contains a 
single, unpaired cysteine. In yet another approach both 

35 endogenous cysteines were replaced by alanines and a third 

non-cysteine residue was replaced by a cysteine, creating 
an analog with a single, unpaired cysteine. 
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Fifteen analogs of gelonin were constructed. Ten 
non-cystein e residues of gelonin were, targeted for 
substitution with a"cysteTne' residue. Comparison "of the 
amino acid sequence of gelonin to the natural amino acid 
5 sequence and tertiary structure of the ricin A-chain (see 
FIG. 1) suggested that these positions would be at the 
surface of the molecule and available . for conjugation. 
Each of the ten gelonin analogs include a cysteine 
substituted in place of one of the following residues: 
10 -- lysineio, asparagine^o, isoleucineioa, aspartic acidm, 
argininei84, serineaw, asparagine^j,, lysine2„, aspartic 
acid2«7, and lysinej^, and the analogs have respectively been 
designated Gelc,o, Gel^o, Gelcjoa, Gelfe„„ Gel„„, Gel„„, 
Gelc23,, Gelc24*, Gelc247, and Gelc2«. 
^5 Two analogs of gelonin were constructed in which 

one of the native gelonin cysteines that participates in an 
endogenous disulfide bond was replaced with a non-cysteine 
. residue. Specifically, the cysteine at position 50 was 
replaced with an alanine residue, creating a gelonin analog 
20 (designated Gelj^jotct*), shown in SEQ ID NO: 99) which has a 

cysteine available for disulfide bonding at position 44. 
The GelAjt),c**) analog has been referred to previously as Gelgu 
(see, e.g., co-owned, co-pending U.S. Patent Application 
Serial No. 07/988,430, incorporated by reference herein). 
25 Conversely, the cysteine at position 44 was replaced with 
an alanine residue, resulting in an analog (designated 
Gel^Mcsof shown in SEQ ID NO: lOO) which has a cysteine 
available for disulfide bonding at position 50. The 
GelAM(C30) analog has been referred to previously as Gelgso 
(see, e.g., co-owned, co-pending U.S. Patent Application 
Serial No. 07/988,430, incorporated by reference herein). 
The combined series of the foregoing twelve analogs thus 
spans the entire length of the mature gelonin protein. 

Another gelonin analog (Gel^t^Aso SEQ ID NO: lOl) 
35 was constructed in which both native gelonin cysteines were 
replaced with alanines. The Gel^AAso analog has been 
referred to previously as Gelc.*,«oA (see, e.g., co-owned, co- 
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pending U.S. Patent Application Serial No. 07/988,430, 
incorporated by reference herein) . Two additional analogs 
were constructed which have alanine residues substituted in 
place of both native cysteines and have either a cysteine 
5 residue substituted in place of the native lysine at 

position 10 (GelcioAAAAso/ shown in SEQ ID NO: 110) or a 
cysteine residue substituted in place of the native 
aspartate at position 247 (Gelc2*7AMA5o# shown in SEQ ID NO: 
- Ill) . 

10 The variants of recombinant gelonin were 

constructed by restriction fragment manipulation or by 
overlap extension PCR with synthetic oligonucleotides* The 
sequences of the primers used for PCR are set out below. 
In each mutagenic primer sequence, the nucleotides 
15 corresponding to the changed amino acid, either a cysteine 

or an alanine residue, are underlined. 

Gelo-9 (SEQ ID NO: 20) 
Gelo-11 (SEQ ID NO: 18) 
Gelo-16 (SEQ ID NO: 25) 
20 Gelo-17 (SEQ ID NO: 27) 

Gelo-18 (SEQ ID NO: 26) 

Gelo-19 (SEQ ID NO: 58) 

5 • CAGCCATGGAATCCCATTGCTG 3 » 

GeloC-1 (SEQ ID NO: 28) 
25 5' TCGATTGCGATCCTAAATAGTACTC 3* 



GeloC-2 (SEQ ID NO: 29) 

51 TTTAGGATCGCAATCGACGAACTTCAAG 3* 

GeloC-3-2 (SEQ ID NO: 30) 

5' GTTCGTCTGTAAAGATCCTAAATAGTACTCGA 3' 
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GeloC-4 (SEQ ID NO: 31) 

5» GGATCTTTACAGACGAACTTCAAGAGT 3* 
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GeloC-5 (SEQ ID NO: 32) 

51 TCTTGTGCTTCGTCGATAAAGATCC 3 » 

GeloC-6 (SEQ ID NO: 33) 

5 • ATCGACGAAGCACAAGAGTGCTATTTT 3 • 

GcloC-9 (SEQ ID NO: 34) 

5» GTAAAACCATGCATAGCACTCTTGAAGTTCGT 3» 

GeloC-10 (SEQ ID NO: 35) 

5* AGTGCTATGCATGGTTTTACTTGATCAACTGC 3» 

GeloC-13 (SEQ ID NO: 36) 

5* AGCACATGTGGTGCCACTTATATTACCTA 3» 

GeloC-14 (SEQ ID NO: 37) 

5* TAAGTGGCACCACATGTGCTAAAGCTCACGGTG 3» 

GeloC-15 (SEQ ID NO: 38) 

5» TGACTGTGGACAGTTGGCGGAAATA 3» 

GeloC-16 (SEQ ID NO: 39) 

5 • GCCAACTGTCCACAGTCATTTGAAAGCGCTACC 3 " 
GeloC-17 (SEQ ID NO: 40) 

5 » GATGATCCTGGAAAGGC2TTCGTTTTGGTAGCGCTT3 » 

GeloC-18 (SEQ ID NO: 41) 
5» AAGCCTTTCCAGGATCATCAGC 
TTTTTTGCGCAGCAATGGG 3» 

GeloC-19 (SEQ ID NO: 42) 

5« AAGCCTTTCCAGGATCATCACAT 3» 

GeloC-2 0 (SEQ ID NO: 59) 

5"» CACATCTAAAACAAGACTTCATTTTGGC 3 • 
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GeloC-21 (SEQ ID NO: 60) 

5 ' TGAAGTCTTGTTTTAGATGTGTTTTTGAAGAGGCCT3 • 

GeloC-22 (SEQ ID NO: 61) 

5» ATGCCATATGCAATTATAAACCAACGGAGA 3' 

5 GeloC-23 (SEQ ID NO: 62) 

5» GGTTTATAATTGCATATGG 
CATTTTCATCAAGTTTCTTG 3* 

GeloC-24 (SEQ ID NO: 63) 

5» CTTTCAACAATGCATTCGCCGGGCGAATAATAC 3» 

10 GcloC-25 (SEQ ID NO: 64) 

5' GCGAATGCATTGTTGAAAGTTATTTCTAATTTG 3» 

GeloC-26 (SEQ ID NO: 65) 

51 GTTTTGTGAGGCAGTTGAATTGGAAC 3» 

GeloC-27 (SEQ ID NO: 66) 
15 5' TTCAACTGCCTCACAAAACATTCCATTTGCACCT 3« 

GeloC--28 (SEQ ID NO: 67) 

5' AAAAGCTGATGATCCTGGJAA6TG 3» 

GeloC-29 (SEQ ID NO: 68) 

5» TCCAGGATCATCAGCTTTTTTGCGCAGCAATGGGA 3* 

20 araB2 (SEQ ID NO: 43) 

5' GCGACTCTCTACTGTTTC 3' 

HrNDIII-2 (SEQ ID NO: 44) 
5.' CGTTAGCAATTTAACTGTGAT 3» 

(1) Specifically, a cysteine was introduced at 
25 amino acid 247 of gelonin (which is normally occupied by an 
aspartic acid which corresponds to the cysteine at position 
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259 in the ricin A-chain) by PGR with mutagenic primers 
GeloC-3--2 and Gg1oC--4 in conjunction with pxinBTs HINDI 
(a primer located in the vector portion of pING3734 or 
pING3825), Gelo-9 and Gelo-8. Template DNA (plNG3734) was 
5 amplified with GeloC-3-2 and HJWDlll-2 and in a concurrent 
reaction with GeIoC-4 and Gelo-9* The products of these 
reactions were mixed and amplified with the outside primers 
Gelo-8 and ifJNDIII-2. The reaction product was cut with 
EcoRl and Xhol, purified, and was inserted into plasmid 

10 • PING3825 in a three-piece ligation. The DNA sequence of 
the Gelc2A7 variant (SEQ ID NO: 102) was then verified. The 
plasmid containing the sequence encoding Gelc247 was 
designated pING373 7 and was deposited- with the American 
Type Culture Collection, 12301 Parklawn Drive, Rockville, 

15 MD 20852 on June 9, 1992 as ATCC Accession No. 69009. 

(2-3) In the same manner, a cysteine residue was 
introduced in place of the amino acid at position 248 (a 
lysine) of gelonin with the mutagenic oligonucleotides 
GeloC-1 and GeloC-2 to generate analog Gel^z^e (SEQ ID NO: 

20 103) in plasmid pING3741, and a cysteine residue was 

introduced at amino acid position 23 9 (normally occupied by 
a lysine) with primers GeloC-9 and GeloC-io to generate 
analog Gel239 (SEQ ID NO: 104) in plasmid pING3744. 

(4) Also in the same manner, a cysteine residue 
25 was introduced at amino acid 244 (a lysine) of gelonin with 

mutagenic primers GeloC-5 and GeloC-6 to generate analog 
Gelc2M (SEQ ID NO: 105) in a plasmid designated pING3736. 
This variant was prepared by PCR using plasmid plNG3734 as 
template DNA rather than pING3825. It therefore encodes 
30 the same N-terminal gelonin amino acid sequence as plasmids 
PING3737, PING3741, and pING374 4, but includes the PCR 
primer-derived 5 '-end 32 nucleotides instead of the native 
gelonin 5»-end nucleotides. 

(5) A cysteine residue was introduced in place of 
35 the amino acid (normally occupied by a lysine) at position 

10 of gelonin by a similar procedure. A cysteine was 
introduced with mutagenic primers GeloC-13 and GeloC-14 by 



wo 94/26910 



PCTAJS94/05348 



-48- 

amplifying pING3824 with araB2 (a vector primer) and GeloC- 
14, and in a separate reaction, with GeloC-13 and Gelo-11* 
--^hese-^rea-eti^n— pred-uGts— we with the 

outside primers araB2 and Gelo-11, The PGR product was cut 
5 with PstI and Ncol, purified, and cloned back into pING3825 

to generate analog Geldo (SEQ ID NO: 106) in the plasmid 
designated pING3746 and . deposited with the American Type 
Culture Collection, 12301 Parklawn Drive, Rockville, MD 
. 20852 on June 9, 1992 as ATCC Accession No. 69008. 

10 (6) The asparagine at position 60 of gelonin was 

replaced with a cysteine residue using two mutagenic 
oligos, GeloC-15 and GeloC-16, in conjrunction with oligos 
araB2 and Gelo-il in the same manner as for the Geld© 
variant. The plasmid encoding the Gelceo (SEQ ID NO: 107) 

15 analog was designated pING3749. 

(7) A cysteine was introduced at amino acid 103 
(an isoleucine) by PGR with mutagenic primers GeloC-20 and 
GeloC-21 in conjunction with primers araB2 and HJNDIII-2. 
Template DNA {pING3733) was amplified with GeloC-21 and 

20 araB2 and separately with GeloC-20 and Hl2^I>III-2. The 

products of these reactions were mixed and amplified with 
the outside primers araB2 and HINDlll-2. The reaction 
product was cut with Wcol and BcJI, purified, and inserted 
into pING3 825 digested with Ncol and BcII. The 

25 oligonucleotides used to place a cysteine at residue 103 

also introduced an Afllll restriction site which was 
verified in the cloned gene. The plasmid containing the 
Gelcios (SEQ ID NO: 108) analog was designated pING3760. 

(8) A cysteine was introduced at position 146 
30 (an aspartic acid) by a similar strategy. Template DNA 

{pING3733) was amplified with mutagenic primer GeloC-22 and 
Gelo-14 and separately with mutagenic primer GeloC-23 and 
Gelq-19. The products of these reactions were mixed, and 
amplified with Gelo-19 and Gelo-14. The reaction product 
35 was cut with BgrJII and EcoRI, and can be inserted into 

pING3825 in a three-piece ligation. The oligonucleotides 
used to place a cysteine at residue 14 6 also introduced a 
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Ndel restriction site which can be verified in the cloned 
gene. 

(9) To introduce a cysteine at position 184 
(normally occupied by an arginine) of gelonin, template DNA 

5 (pING3733) was amplified with mutagenic primer GeloC-25 and . 

araB-2 and separately with mutagenic primer GeloC-24 and 
lfJJ^DIII-2. The products of these reactions were mixed, and 
amplified with araB2 and Gelo-14. The reaction product was 
cut with Ncol and BcII, and inserted into pING3825 

10 - previously digested with Ncol and Bell. The 
oligonucleotides used to place a cysteine at residue 184 
also introduced an Nsil restriction site which was verified 
in the cloned gene. The plasmid containing the sequence 
encoding the Gelde* (SEQ ID NO: 109) variant was designated 

15 PING3761. 

(10) A cysteine may be introduced at position 
215 (a serine) by a similar strategy. Template DNA 
(pING3733) was amplified with mutagenic primer GeloC-27 and 
araB2 and separately with mutagenic primer GeloC-26 and 

20 HJWJDIII-2. The products of these reactions were mixed, and 
amplified with araB2 and HINDJll-2. The reaction product 
was cut with BcoRI and Bell, and may be inserted into 
pING3825 in a three-piece ligation. 

(11) Another gelonin variant with a free cysteine 
25 residue was generated by replacing one of the two naturally 

occurring gelonin cysteine residues, the cysteine a 
position 50, with an alanine. Plasmid pING3824 was 
amplified with primers GeloC-17 and Gelo-11, and 
concurrently in a separate reaction with primers GeloC-19 

30 and araB2. The reaction products were mixed and amplified 
with araB2 and Gelo-11. This product was cut with Ncol and 
Bglll, and cloned back into the vector portion of pING3825 
to generate pING3747 (ATCC 69101) . This analog was 
designated Gel^socc**) contains a cysteine available for 

35 disulfide bonding at amino acid position 44, Non-cysteine 
residues, other than alanine, which do not disrupt the 
activity of gelonin, also may be inserted at position 50 in 
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natural gelonin in order to generate a gelonin analog with 
a single cysteine at position 44. 

(12) A gelonin variant in vhich the natural 
cysteine at position 44 was changed to alanine was 

5 constructed by amplifying pING3733 using the mutagenic 

oligonucleotides GeloC-28 and GeloC-29 in conjunction with 
primers araB2 and HJNDIII-2 . The amplified DNA was cut 
with NcoT and Bgrlll and cloned into a gelonin vector ^ 
generating pING3756* That variant generated was designated 
10 GelA4MC50)- Non-cysteine residues, other than alanine, which 
do not disrupt gelonin activity, also may be inserted at 
position 44 in order to generate a gelonin analog with a 
single cysteine at position 50. 

(13) ' A gelonin variant in which both the 
15 cysteine at position 44 and the cysteine at position 50 of 

gelonin were changed to alanine residues was constructed by 
overlap PCR of pING3824 using the mutagenic 
oligonucleotides GeloC-17 and GeloC-18 in conjunction with 
primers araB2 and Gelo-11. This analog, like the native 

20 gelonin protein, has no cysteine residues available for 

conjugation- The plasmid encoding the analog was 
designated pING3750. The analog generated was designated 
GelA**A5o (SEQ ID NO: 101). Non-cysteine residues, other than 
alanine, which do not disrupt gelonin activity, also may be 

25 substituted at both positions 44 and 50 in order to 

generate a gelonin analog with no cysteine residues. 

(14) The triple mutant Geloninc247A44A5o (SEQ ID NO: 
111) was constructed from the plasmids pING3824, pING3750 
and pING3737. This variant contains an introduced cysteine 

3 0 at position 247 while both of the naturally occurring 
cysteine residues at positions 44 and 50 have been replaced 
with alanine. The analog is desirable because, in this 
analog, disulfide linkage to an antibody is only assured at 
a single cysteine residue. Plasmid pING3824 was cut with 

35 NcoJ and Xhol and the vector fragment was purified in an 
agarose gel. pING3750 was cut with NcoT and EcdRl and 
pING3737 was cut with .EcoRI and Xhol. The NcoI-£coRI 
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fragment encodes the alanines at positions 44 and 50 while 
the j^coRI-XhoI fragment encodes the cysteine at position 
247. Each of these fragments was purified ~and~iigated to 
the Ncol to XhoJ vector fragment. The resulting plasmid is 
5 named pING3752. 

(15) The triple mutant GelonincioA44A5o (SEQ ID NO: 
110) was also constructed by assembly from previously 
assembled plasmids. In this case, pING3746 was cut with 
Pstl and i^col, while pING3750 was cut with Ncol and XhoJ. 

10 ' Each of the insert fragments were purified by 
electrophoresis in ah agarose gel, and the fragments were 
ligated into a Pstl and Xhol digested vector fragment. The 
resulting vector was designated pING3753. The GelcioAAAASo 
analog has been referred to previously as Gelcioc44AC50A (see, 

15 e.g., co-owned, co-pending U.S. Patent Application Serial 

No. 07/988,430, incorporated by reference herein) • 

Each of the gelonin variants constructed was 
transformed into E. coli strain E104. Upon induction of 
bacterial cultures with arabinose, gelonin polypeptide 

20 could be detected in the culture supernatants with gelonin- 

specific antibodies. There were no significant differences 
detected in the expression levels of gelonin from plasmids 
pING3734 and pING3825, or in the levels from any of the 
gelonin variants. Each protein was produced in coli at 

25 levels of approximately 1 g/1. 

Example 4 

Reticulocyte Lvsate Assay 

The ability of gelonin and recoanbinant gelonin 
analogs to inhibit protein synthesis in vitro was tested 

30 using a reticulocyte lysate assay (RLA) described in Press 

et al., Immunol. Letters, 14:37-41 (1986). The assay 
measures the inhibition of protein synthesis in a cell-free 
system using endogenous globin mRNA from a rabbit red blood 
cell lysate. Decreased incorporation of tritiated leucine 

35 (^H-Leu) was measured as a function of toxin concentration. 

Serial log dilutions of standard toxin (the 30 kD form of 
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ricin A-chain, abbreviated as RTA 30) , native gelonin, 
recombinant gelonin (rGelonin or rGel) and gelonin analogs 
were tested over a range of l /ig/ml to 1 pg/ml. Samples 
were tested in triplicate, prepared on ice, incubated for 
5 30 minutes at 37 *C, and then counted on an Inotec Trace 96 
cascade ionization counter. By comparison with an 
uninhibited sample, the picomolar concentration of toxin 
(pM) which corresponds to 50% inhibition of protein 
synthesis (IC50) was calculated. As is shown in Table 1 
10 below, recombinant gelonin and most of its analogs exhibit 
activity in the RliA comparable to that of native gelonin. 
For some of the analogs (such as Gelc239)L, RLA activity was 
diminished. 

Table 1 

15 Toxin IC^olpMl 

RTA 30 2.5 
Gelonin 15 
rGelonin 11 
Gelcio 60 

20 GelA50(C4«) 20 

GelAA*(C50) 47 
Gelc6o 26 
Gelc239 955 
Gelc2AA 32 

25 Gelc247 12 

Gelc248 4 7 

Gel^AAASO 16 
GelcioAA450A 7 
Gelc247AA«A50 2 0 
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Example 5 

Himan-Engineered Antibodies 

For- Construct ion--Of—I-minunot-Gxi-ns - — — 

Antibodies for use in constructing immunotoxins 
5 according to the present invention may be humanized 
antibodies, such as he3 and fragments thereof which display 
increased content of human amino acids and a high affinity 
for human CDS cell differentiation marker. he3 is a 
humanized form of a mouse H65 antibody (H65 is a preferred 

10 monoclonal antibody for use in preparing humanized 
antibodies according to the present . invention and is 
produced by hybridoma cell line XNMLY-H65 (H65) deposited 
with the American Type Culture ColleQtion in Rockville, 
Maryland (A.T.C.C.) and given the Accession No. HB9286) . 

15 Humanized antibodies for use in the present 

invention are prepared as disclosed herein using the 
humanized forms of the murine H65 antibody in which both 
low and moderate risk changes described below were made in 
both variable regions. Such humanized antibodies should 

20 have less immunogenicity and have therapeutic utility in 
the treatment of autoimmune diseases in humans. For 
example, because of their increased affinity over existing 
therapeutic monoclonal antibodies such as H65, he3 
antibodies of the invention may be administered in lower 

25 doses than H65 anti-CD5 antibodies in order to obtain the 

same therapeutic effect. 

Humanized antibodies, such as he3, are useful in 
reducing the immunogenicity of foreign antibodies and also 
results in increased potency when used as a portion of an 

30 immunoconjugate. 

Construction of humanized antibody variable 
domains according to the present invention and for use as 
components of immunotoxins may be based on a method which 
includes the steps of: (1) identification of the amino acid 

35 residues of an antibody variable domain which may be 

modified without diminishing the native affinity of the 
domain for antigen while reducing its immunogenicity with 
respect to a heterologous species; and (2) the preparation 
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of antibody variable domains having modifications at the 
identified residues which are useful for administration to 
heterologous species* The methods of the invention are 
based on a model of the antibody variable domain described 
5 herein which predicts the involvement of each amino acid in 
the structure of the domain. 

Unlike other methods for humanization of 
antibodies, which advocate replacement of the entire 
classical antibody framework regions with those from a 

10 human antibody, the methods described herein introduce . 
human residues into the variable domain of an antibody only 
in positions which are not critical for antigen-binding 
activity and which are likely to be exposed to 
immunogenicity-stimulating factors. The present methods 

15 are designed to retain sufficient natural internal 

. structure of the variable domain so that the antigen- 
binding capacity of the modified domain is not diminished 
in comparison to the natural domain. 

The human consensus sequences in which moderate 

20 risk residues are converted from mouse residues to human 

residues are represented in Figures IDA and lOB as lines 
labelled hKl (i.e., subgroup 1 of the human kappa chain) 
and hH3 (i.e., subgroup 3 of the human heavy chain). 
Symbols in the figures for conservation and for risk in 

25 "bind" and "bury" lines are follows: 

First Symbol in Pair (Ligand Binding) 
+ Little or not direct influence on antigen- 
binding loops, low risk if substituted 
^ Indirectly involved in antigen-binding loop 
30 Structure, moderate risk if changed 

Directly involved in antigen-binding loop 
conformation or antigen contact, great risk 
if modified 
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Second Symbol in Pair (Immunogenicity/Struture) 



+ Highly accessible to solvent, high 

immunogenicity, low risk if substituted 
« Partially buried, moderate immunogenicity, 
5 moderate risk if altered 



Completely buried in subunit's hydrophobic 
core, low immunogenicity , high risk if 
changed 

= Completely buried in interface between 
10 ' subunits, low immunogenicity, high risk if 

modified 

Significance of Pairs 
++ Low risk 

Highly accessible to solvent 
15 and high immunogenicity, but 

little or no effect on specific 
antigen binding 
+ , «>o Moderate Risk 

Slight immunogenicity or indirect 
20 involvment with antigen binding 

any - or = High risk 

Buried within the subunit core/ 
interface or strongly involved in 
antigen binding, but little immunogenic 
25 potential 

In the line labelled "mod", a dot (.) represents 
a residue which may be mutated from "mouse" to "human" at 
moderate risk. There are 29 such moderate risk positions. 

The mouse residue matches the human consensus 
30 residue more than 50% of the time at 131 positions (102 

positions match 90%-100% and 29 positions match 50% to 
90%). These positions were not changed. 

The lines labelled M/H in Figures 12A and 12B 
indicate the 91 positions which differed significantly 
35 between the mouse and human sequences {i.e., where the 
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hviman sequences have the jnouse residue less than 50% of the 
time). Moderate risk positions, designated m in the M/H 
line, were -kept- -^'mouse"; whereas those designated H or h 
were changed to human. The 25 low risk positions which 
5 were already human-like or which were previously humanized 
(as described supra in Example 2) are designated •» in 
the M/H line. Finally, the 54 high risk positions in which 
the mouse and human residues did not match are designated 
M and are kept "mouse". 
0 Fifteen differences occur at moderate risk 

positions at which the mouse and hixman sequences differ. 
At ten of those positions (designated "Ul* on the M/H lines 
of Figure 6) the mouse residue aligns with a human 
consensus amino acid which is highly conserved. Therefore, 
5 the mouse residue at that position is identified as one to 

be changed to the conserved human- residue. 

At moderate risk positions (designated "m") in 
which the mouse and the human sequences differ, the mouse 
residue aligns with a hvunan consensus amino acid which is 
0 moderately conserved. However, since the mouse residue is 

found at that position in other actual sequences of human 
antibodies [See Kabat, et a2 . , sequences of Proteins of 
Immunoglobulin Interest, Fourth Edition, U.S. Department of 
Health and Human Services, Public Health Service, National 
Institutes of Health (1987)] the positions are identified 
as ones to be kept "mouse." Although there are no such 
positions in this particular sequence, such positions may 
occur in other antibodies. 

At four moderate risk positions (designated "h"), 
the mouse residue aligns with a human consensus amino acid 
which is moderately conserved but the mouse residue is not 
found at that position in an actual hiiman antibody sequence 
in Kabat, et al . Sequences of Proteins of Immunoglobulin 
Interest^ supra. Therefore, that position is identified as 
ones to be changed to "htunan." 

At one moderate risk position (designated "m") in 
which the mouse and human secjuences differ, the mouse 
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residue aligns with a human consensus amino acid which is 
poorly conserved. Therefore, that position is identified 
as one to be "Jcept" "mouse 

A. Assembly Of Moderate Risk 
5 Heavy Chain Expression Vectors 

The humanized H65 heavy chain containing the 
moderate risk residues was assembled by the following 
strategy. The moderate-risk expression, vector Was 
assembled from intermediate vectors. The six 

10 oligonucleotide sequences (oligos) , disclosed in Figure 12 

and labelled HUH-Gll, HUH-G12, HUH-G3, HUH-G4 , HUH-G5, and 
HUH-G6 (the sequences of HUH-Gll and HUH-G12 are set out in 
SEQ ID Nos. 131 and 132 .and HUH-G3, HUH-G4, HUH-G5, and 
HXJH-G6 are set out in SEQ ID NOS: 137-14 0) were assembled 

15 by PGR. Oligonucleotides containing the synthetic 

hximanized antibody gene were mixed in pairs (HUH-Gll + 
HUH-G12, HUH-G3 + HUH-G4, and HUH-G5 + HUH-G6) in a 100 /zl 
reaction with 1 /ig of each DNA and filled in as described 
above. A portion of each reaction product was mixed in 

20 pairs (HUH-Gll, 12 + HUH-G3, 4; HUH-G3 , 4 + HUH-G5, 6), 2.5 

U Taq was added and samples were reincubated as described 
above. The V-J region was assembled by mixing equal 
amounts of the HUH-Gll, 12, 3, 4 reaction product with the 
HUH-G3, 4, 5, 6 product, followed by PGR with 0.5 ug of 

25 primers H65G-2S and H65-G2 as described above. The 

reaction product was cut with Sail and Bs tEII and cloned 
into the expression vector, similar to that described for 
heavy chain in Robinson et aJ., Hum. Antibod. Hybridomas 
2:84 (1991), generating pING46l7. That plasmid was 

30 sequenced with Sequenase (USB, Cleveland), revealing that 

two residues were altered (a G-A at position 288 arid a A-T 
at position 312, numbered from the beginning of the leader 
sequence) * The correct variable region was restored by 
substitution of this region from pING4612, generating the 

35 expected V-region sequence in pING4619. 

An intermediate vector 'containing the other 
moderate-risk changes was constructed by PGR* assembly of 
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the oligos HUH-G13, HUH-G14, HUH-G15, and HUH-GIG (Fig. 11 
and SEQ ID Nos: 133-136). Oligos HUH-G13 + HUH-G14 and 
— -HtJH-S-15-- + — ^HUH-e-16— were-~in-ixed-"and -f iileid" -±n — wxthr- Vent 
polymerase (New England Biotabs) in a reaction containing 
10 mM KCl, 20 mM TRIS pH 8.8, 10 mM (}m^)2S02, 2mM MgSO^, 
0.1% Triton X-100, 100 ng/ml BSA, 200 ^M of each dNTP, and 
2 units of Vent polymerase in a total volume of 100 /il» 
The reaction mix was incubated at 94*^0 for 1 minute, 
followed by 2 minutes at 50^C and 20 minutes at 72«*C. The 
reaction products (4 0 ^1) were mixed and amplified with the 
oligonucleotides H65-G13 and H65-G2 with Vent polymerase in 
the same reaction buffer and amplified Sor 25 cycles with 
denaturation at 94**C for 1 minute, annealing at 50^C for 2 
minutes and polymerization at 72 ®C for 3 minutes. The 
reaction product was treated with T4 polymerase and then 
digested with^AccI. The 274 base pair (bp) fragment was 
purified on an agarose gel and ligated along with the 141 
bp Sail to AccI fragment from pIKG4 619 into pUClS cut with 
Sail and Smal to generate pING4 620. pING4 620 contains the 
entire signal sequence, V-region, and J-region of the 
moderate-risk H65 heavy chain - 

The final expression vector for the moderate-risk 
H65 heavy chain, pING4621, was assembled by cloning the 
Sail to BstEIl fragment from pING4 620 into the same 
expression vector described above. 

B. Assembly Of Moderate-Risk 

Light Chain Expression Vectors 

The moderate-risk humanized V- and J-segments of 

the light chain were assembled from six oligonucleotides, 

$H65K-1 (SEQ ID NO: 117), HUH-K7 (SEQ ID KO: 119), HUH-K6 

(SEQ ID NO: 118), HUH-K8 (SEQ ID NO: 120), HUH-K4 (SEQ ID 

NO: 121 and HUH-K5 (SEQ ID NO: 122). The oligonucleotides 

were amplified with PGR primers K65K-2S and JKl-lTijidlll . 

Oligonucleotides containing the synthetic humanized 

antibody gene were mixed in pairs ($H65-K1 + HUH-K7, HUH-K6 

+ HUH-K4 + HUH-K5) and incubated with Vent polymerase as 

described for the moderate-risk heavy chain. A portion of 
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each reaction product (40 ul) was mixed in pairs 
($H65H-Kl/mJH-K7 + HUH-K6 , 8; HUH~K6, 8 + HUH-K4 , 5) and 
filled in as above . The Tight chain "gene was then 
assembled by amplifying the full length gene with the PCR 
5 primers H65K-2S and JKl-lfindlll with Vent polymerase for 25 
cycles as outlined above. The assembled V/J region was cut 
with Sail and HindlJl, purified by electrophoresis on an 
agarose gel, and assembled into a light chain antibody 
expression vector, pING4630. 

10 Ex^unple 6 

Transfection Of he3 Genes And 
Purification Of Expression Products 

A. Stable Transfection Of Mouse Lymphoid 

Cells For The Production Of he3 Antibody 

15 The cell line Sp2/0 (American Type Culture 

Collection Accession No. CRL1581) was grown in Dulbecco's 
Modified Eagle Medium plus 4.5 g/1 glucose (DMEM, Gibco) 
plus 10% fetal bovine serum. Media were supplemented with 
glutamine/penicillin/streptomycin (Irvine Scientific, 

20 Irvine, California) . 

The electr operation method of Potter, H. , et al . , 
Proc. Natl. Acad, Sci., USA, 81:7161 (1984) Was used. 
After transfection, cells were allowed to recover in 
complete DMEM for 24-48 hours, and then seeded at 10,000 to 

25 50,O00 cells per well in 96-well culture plates in the 

presence of selective medium. Histidinol (Sigma) selection 
was at 1.71 Mg/mlf and mycophenolic acid (Calbiochem) was 
at 6 /zg/ml plus 0.25 mg/ml xanthine (Sigma). The 
electroporation technique gave a transfection frequency of 

30 1-10 X 10'^ for the Sp2/0 cells. 

The he3 light chain expression plasmid plNG4630 
was linearized by digestion with Pvul restriction 
endcnuclease and transfected into Sp2/0 cells, giving 
mycophenolic acid - resistant clones which were Screened 

35 for light chain synthesis. 

Four of the top-producing subclones, secreting 
4.9^7.5 ^g/ml were combined into two pools (2 clones/pool) 
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and each pool was transfected with plasmid pING42621, 
containing the moderate-risk heavy chain. After selection 
with histidinol, the clones producing the most light plus 
heavy chain, Sp2/0-4630 and 4621 Clones C1705 and C1718, 
5 secreted antibody at approximately 15 and 22 /zg/ul 

respectively in the presence of 10"' M dexamethasone in an 
overgrown culture in a T2 5 flask. Clone C1718 was 
deposited with the American Type Culture Collection, 1230 
Parklawn Drive, Rockville, Maryland, 20852 on December 1, 
10 1992 as ATCC HB 11206. The best producer is a subclone of 

Clone C1718 which is produced by limiting dilution 
subcloning of Clone C1718. ^ 

B. Purification Of he3 Antibody 
Secreted In Tissue Culture 

15 Sp2/0-4 630 + 4 621 Clone C1705cells were grown in 

culture medium HBlOl (Hana Biologies) + 1% Fetal Bovine 
SerxuD, supplemented with 10 mM HEPES, Ix Glutamine-Pen- 
Strep (Irvine Scientific #9316) • The spent medium was 
centrifuged at about 5,000 x g for 20 minutes- The 

20 antibody level was measured by ELISA. Approximately 200 ml 

of cell culture supernatant was loaded onto a 2 ml Protein 
A-column (Sigma Chemicals) , equilibrated with PBS (buffer 
0.15 M NaCl, 5 mM sodium phosphate, 1 mM potassium 
phosphate, buffer pH 7.2). The he3 antibody was eluted 

25 with a step pH gradient (pH 5.5, 4.5 and 2.5). A fraction 

containing he3 antibody (9% yield) but not bovine antibody, 
was neutralized with 1 M Tris pH 8.5, and then concentrated 
10-fold by Centricon 30 (Amicon) diluted 10-fold with PBS, 
reconcentrated 10-fold by Centricon 30, diluted 10-fold 

30 with PBS, and finally reconcentrated 10-fold. The antibody 

was stored in 0.25 ml aliguots at -20^ C. 

C. Affinitv Keasureiugnts Of he3 TqG For CDS 

The affinity of he3 IgG for CDS was determined 
using Molt-4M cells, which express CDS on their surface, 
35 and I^^^-labeled chimeric H65 IgG in a competitive binding 
assay. Culture supernatants from Clone C1705 and C1718 and 
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purified IgG from C1705 were used as the sources of he3 
IgG, 



For this assay, 20 Mg of "cYiieric" H65 TgG (cH65 
IgG) vas iodinated by exposure to lOO 
5 lactoperoxidase-glucose oxidase immobilized beads 

(Enzymobeads, BioRad) , 100 ^1 of PBS, 1.0 mCi l^" (Amersham, 
IMS30) , 50 III of 55 mM b-D-glucose for 45 minutes at 23**C. 
The reaction was quenched by the addition of 20 fil of 105 
mM sodium metabisulf ite and 120 mM potassixim iodine 
10 followed by centrif ugation for 1 minute to pellet the 

beads. ^"l-cH65 IgG was purified by gel filtration using 7 
mis of sephadex G25^ using PBS (137 mM NaCl, 1.47 mM KH2PO4, 
8*1 mM Na2HP0^, 2.68 mM KCl at pH 7.2-7.4) plus 0.1% BSA. 
^"l-cH65 IgG recovery and specific activity were determined 
15 by TCA precipitation. 

Competitive binding was performed as follows: 
100 Ml of Molt-4M cells were washed two times in ice-cold 
DHB binding buffer (Dubellco's modified Eagle's medium 
(Gibco, 320-1965PJ), 1.0% BSA and 10 mM Hepes at pH 7.2. 
20 -7.4). Cells were resuspended in the same buffer, plated 

into 96 v-bottomed wells (Costar) at 3 x 10^ cells per well 
and pelleted at 4''C by centrif ugation for 5 min at 1,000 
rpm using a Beckman JS 4.2 rotor; 50 /il of 2X-concentrated 
0.1 nM ^2^I-cH65 IgG in DHB was then added to each well and 
25 competed with 50 /il of 2X - concentrated cH65 IgG or 

humanized antibody in DHB at final antibody concentrations 
from 100 nM to 0.0017 nM. Humanized antibody. was obtained 
from culture supernatants of Sp2/0 clone C1718 which 
expresses he3 IgG. The concentration of the antibody in 
30 the supernatants was established by ELISA using a chimeric 
antibody as a standard. The concentration of the antibody 
in the purified preparation was determined by binding was 
allowed to proceed at 4*C for 5 hrs and was terminated by 
washing cells three times with 200 /il of DHB binding buffer 
35 by centrif ugation for 5 min at 1,000 rpm. All buffers and 

operations were at 4^C. Radioactivity was determined by 
solubilizirig cells in 100 ^1 of i.O M NaOH and counting in 
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a Cobra II auto gamma counter (Packard) . Data from binding 
experiments were analyzed by the weighted nonlinear least 
squares curve fitting program, MacLigand, a Macintosh 
version of the computer program "Ligand" from Munson, 
5 Analyt. Biochem.^ 107:220 (1980). Objective statistical 

criteria (F, test^ extra siun squares principle) were used 
to evaluate goodness of fit and for discriminating between 
models • Nonspecific binding was treated as a parameter 
subject to error and was fitted simultaneously with other 
10 parameters. 

Figure 11, provides data showing relative 
binding of he3 and CH65 to CD5 on 3ftplt-4M cells in a 
competition binding assay • These results demonstrate that 
the moderate-risk changes made in he3 IgG result in an 
15 antibody with a higher affinity than the chimeric mouse- 

human form of this antibody (cH65) for its target, CDS. 

Example 7 

Preparation of Gelonin Immunoconiuaates 

Gelonin analogs of the invention were variously 

20 conjugated to murine (ATCC HB9286) and chimeric H65 (cH65) " 

antibody, cH65 antibody domains (including cFab, cFab' and 
cF(ab')2 fragments), and humanized antibodies and antibody 
domains, all of which are specifically reactive with the 
human T cell determinant CD5. H65 antibody was prepared 

25 and purified by methods described in U.S. Patent 
Application Serial No. 07/306,433, supra and International 
Publication No. WO 89/06968, supra. Chimeric H65 antibody 
was prepared according to methods similar to those 
described in Robinson et al.. Human Antibodies and 

30 Hybridojoas^ 2:84-93 (1991), incorporated by reference 
herein. Chimeric H65 Fab, Fab', and F(ab')2 proteins were 
prepared as described in Better, et al., Proc. Nat. Acad. 
Sci. (USA), 90: 457-461 (1993), incorporated by reference 
herein. Finally, he3 humanized antibodies were prepared 

35 according to the procedures described in U.S. Patent 



M'O 94/26910 



PCT/US94/05348 



-63- 

Application Serial No. 07/808,464, incorporated by 
reference herein. 

A. Conjugation To H65 Antibodies 

To expose a reactive sulfhydryl, the unpaired 
5 cysteine residues of the gelonin analogs were first reduced 

by incubation with 0.1 to 2 mM DTT (30-60 minutes at room 
temperature) , and then were desalted by size-exclusion 
chromatography. 

Specifically, the Gelcz^e analog (3.8 mg/ml) was 

10 treated with 2 xoM DTT for 60 minutes in 0.1 M Naphosphate, 
0.25 M NaCl, pH 7.5 buffer. The Gelca^^ variant (7.6 mg/ml) 
was treated with 2 mM DTT for 30 minutes in 0.1 M 
Naphosphate, 0.25 M NaCl, pH 7.5 buffer. The Gelc2A7 analog 
(4 mg/ml) was treated with 2 mM DTT for 30 minutes in 0.1 

15 M Naphosphate, 0.5 M NaCl, pH 7.5 buffer with 0.5 mM EDTA. 

The Gelc239 variant (3.2 mg/ml) was treated with 2 mM DTT for 
3 0 minutes in 0.1 m Naphosphate, 0.5 M NaCl, pH 7.5 buffer 
with 0.5 mM EDTA. The Gel^o(c*o analog (4.2 mg/ml) was 
treated with 0.1 mM DTT for 30 minutes in 0.1 M 

20 Naphosphate, 0.1 M NaCl, pH 7.5 buffer with 0.5 mM EDTA. 

Lastly, the Geldo variant (3.1 mg/ml) was treated with 1 mM 
DTT for 20 minutes in 0.1 M Naphosphate, 0.1 M NaCl, pH 7.5 
buffer with 1 mM EDTA. 

The presence of a free sulfhydryl was verified by 

25. reaction with DTNB and the average value obtained was 1.4 
± 0.65 SH/molecule. No free thiols were detected in the 
absence of reduction. 

H65 antibody and chimeric H65 antibody were 
chemically modified with the hindered linker 5-methyl-2- 

30 iminothiolane (M2IT) at lysine residues to introduce a 

reactive sulfhydryl group as described in Goff et al., 
Bioconjugate Chem., 1:381-386 (1990) and co-owned Carroll 
et al., U.S. Patent No. 5,093,475, incorporated by 
reference herein. 

35 Specifically, for conjugation with Gelc2*8 and 

Gelc264f murine H65 antibody at 4 mg/mL was derivitized with 
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18x M2IT and 2.5 mM DTNB in 25 inM TEOA, 150 mM NaCl, pH 8 
buffer for 1 hour at 23 *C. The reaction gave i.9 linkers 
-per— antibody_as det.ermined-.,by_aTNB assay- 

For conjugation with Gelca*? ^nd Gelc239, H65 
5 antibody at 4.7 mg/mL was derivitized with 2 Ox M2IT and 2.5 

mM DTNB in 25 mM TEOA 150 mM NaCl, pH 8 buffer for 50 
minutes at 23 *C. The reaction gave 1.6 linkers per 
antibody as determined by DTNB assay. 

Before reaction with Gel^soccA*)/ H65 antibody at 5.8 
10 mg/mL was derivitized with 20x in2IT and 2.5 mM DTNB in 25 

mM TEOA, 150 mM NaCl, pH 8 buffer for 30 minutes at 23 *C. 
The reaction gave 1.5 linkers per antibody as determined by 
DTNB assay. 

For conjugation with Geldo/ H65 antibody at 2.2 
15 mg/mL was derivitized with lOx m2IT and 2.5 inM DTNB in 25 

mM TEOA, 150 loM NaCl, pH 8 buffer for 1 hour at 23*C. The 
reaction gave 1.4 li]ikers per antibody as determined by 
DTNB assay. 

Chimeric H65 antibody was prepared for 
20 conjugation in a similar manner to murine H65 antibody. 

.Two methods were initially compared for their 
effectiveness in preparing immunoconjugates with 
recombinant gelonin. First, the native disulfide bond in 
recombinant gelonin was reduced by the addition of 2inM DTT 
25 at room temperature for 30 minutes. The reduced gelonin 

was recovered by size-exclusion chromatography on a coliiinn 
of Sephadex GF-05LS and assayed for the presence of free 
sulfhydryls by the DTNB assay. 1.4 free SH groups were 
detected. This reduced gelonin was then reacted with H65- 
30 (M2IT)-S-S-TNB (1.8 TNB groups/H65) . Under these 

experimental conditions, little or no conjugate was 
prepared between reduced gelonin and thiol-activated H65 
antibody. 

In contrast, when both the recombinant gelonin 
35 and the H65 antibody were first derivitized with the 

crosslinker M2IT (creating gelonin-- (M2IT)-SH and H65- 
(M21T)-S-S-TNB) and then mixed together, H65- (M2IT) -S-S- 
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(M2IT)-gelonin conjugate was prepared in good yield 
(toxin/antibody ratio of 1.6). The starting materials for 
this conjugation (gelonin- (M2IT) -SH and H65- (M2IT) -S-S-TNB) 
contained linker /protein ratios of 1.2 and 1.4, 
respectively. Native gelonin was derivatized in a similar 
manner prior to conjugation to murine or chimeric H65 
antibody. 

The reduced gelonin analogs were mixed with H65- 
(M21T) -S-S-TNB to allow conjugation. The following 
: conjugation reactions were set up for each analog: 23 mg 
(in 7.2 ml) of H65-M2IT-TNB were mixed with a 5-fold molar 
excess of Gelcj*, (23 mg in 6 ml) for 2 hours at room 
temperature, then for 18 hours overnight at 4*C; 23 mg (in 
7.3 ml) of H65-m2IT-TNB were mixed with a 5-fold molar 
15 excess of Gelcju (23 mg in 3 ml) for 3 hours at room 
temperature, then for 18 hours overnight at 4*C; 9 mg (in 
2.8 mL) of H65-m2IT-TNB were mixed with a 5-fold molar 
excess of Gelc2„ (9 mg in 2.25 mL) for 2 hours at room 
temperature, then for 5 nights at 4*0; 9 mg (in 2.8 mL) of 
20 H65-m2IT-TNB were mixed with a 5-fold molar excess of Gel„„ 
(9mg in 2.6 mL) for 2 hours at room temperature, then at 
4*C for. 3. days; 12 mg (in 1.9 mL) of H65-m2IT-TNB were 
mixed with a 5.6-fold molar excess of Gel^5o,c4», (13.44 mg in 
3.2 mL) for 4.5 hours at room temperature, then 4*C 
25 overnight; and 11 mg of H65-m2IT-TNB were mixed with a 5- 
fold molar excess of Geldo (11 mg in 3.5 mL) for 4 hours at 
room temperature, then at 4*C overnight. 

Following conjugation, unreacted M2IT linkers on 
the antibody were quenched with 1:1 mole cysteamine to 
linker for 15 minutes at room temperature. The quenched 
reaction solution was then loaded onto a gel filtration 
column [Sephadex G-150 (Pharmacia) in the case of Gelc2„, 
G®lc2«7» Gelc24» and Gelcjj, and an AcA-44 coliimn (IBP 
Biotecnics, France) in the case of Gel^^^c**) and Gel„o3 . The 
reactions were run over the gel filtration columns and 
eluted with 10 mM Tris, 0.15M NaCl pH 7. The first peak 
off each column was loaded onto Blue Toyopearl® resin 



30 



35 
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(TosoHaas, Philadelphia, Pennsylvania) in 10 mM Triis, 30 mM 
NaCl, pH 7 and the product vas eluted with 10 mM Tris, 0.5 

"W^alCXr pH~7T5. ' 

Samples of the final conjvigation products were 
5 run on 5% non-reduced SDS PAGE, Coomassie stained and 

scanned with a Shimadzu laser densitometer to quantitate 
the number of toxins per antibody (T/A ratio) . The yield 
of final product for each analog conjugate was as follows: 
-Gelc2«8f ™g with a T/A ration of 1.6; Gelc2*7# 1.1 mg with 

10 a T/A ratio of 1; Gelc2A*r 4.5 mgs with a T/A ratio of 1.46; 

Gelc239r 2*9 mg with a T/A ratio of 2.4; GcIasocw*)/ 7.3 mg 
with a T/A ratio of 1.22; and Gelcio, 6.2 "fiig with a T/A ratio 
of 1-37. Conjugation efficiency {i.e., conversion of free 
antibody to immunocon jugate) was significantly greater 

15 (-80%) for some analogs (Geldo* GelA50(c*«)/ <^elc239# Gelc2*7# and 

Gelc2*B) than for others (-10%, Gelc244) • 

B. Gelonin Immunocon jugates With 

chimeric And Humanized Antibodies 

Analogs Gelc2*7 and GelAsocc**) were also conjugated 

20 to various chimeric [cH65Fab, cH65Fab' and cH65F(ab»)2] and 

"human engineered" [hel Fab, he2-Fab, he3-Fab, hel Fab* and 
hel F(ab')2] antibody fragments. Chimeric H65 antibody 
fragments may be prepared according to the methods 
described in International Publication No. WO 89/00999, 

25 supra. The DNA sequences encoding the variable regions of 
H65 antibody fragments that were human engineered 
(referring to the replacement of selected murine-encoded 
amino acids to make the H65 antibody sequences less 
immunogenic to humans) according to the methods described 

30 above in Example 5, are set out in SEQ ID NO: 69 (the 

kappa chain of hel and he2) , SEQ ID NO: 70 (the gamma chain 
of hel), SEQ ID NO: 71 (the gamma chain of he2 and he3) and 
SEQ ID NO: 72 (the kappa chain of he3) 

The chimeric H65 antibody fragments were 

35 conjugated to the Gelc247 analog in the same manner as 
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20 



described below for conjugation of human engineered Fab and 
Fab' fragments toGelcj*, and Gel^jojc**)- 



(i) hel Fab-Gelr -^? 

The hel Fab was diaiyzed into 25 mM TEOA buffer, 
5 250 mM NaCl, pH 8 and then concentrated to 6,8 mg/mL prior 
to derivitization with the M2IT crosslinker. For the 
linker reaction, M2IT was used at 20-fold molar excess, in 
the presence of 2.5 mM DTNB. The reaction was allowed to 
-proceed for 30 minutes at room temperature, then desalted 
10 on GF05 (gel filtration resin) and equilibrated in 0.1 M Na 
phosphate, 0.2M NaCl, pH 7.5. A linker number of 1.8 
linkers per Fab was calculated based dn the DTNB assay. 
The hel Fab-M2IT-TNB was concentrated to 3.7 mg/mL prior to 
conjugation with Gelc2„. 

Gelc2„ at 12.8 mg/mL in 10 mM Na phosphate, 0.3M 
NaCl, was treated with 1 mM DTT, 0.5 mM EDTA for 20 minutes 
at room temperature to expose a reactive sulfhydryl for 
conjugation and then was desalted on GF05 and equilibrated 
in 0.1 M Na phosphate, 0.2 M NaCl, pH 7.5. Free thiol 
content was determined to be 0.74 moles of free SH per mole 
. of Gelc2„ using the DTNB assay. The gelonin was • 
concentrated to 8.3 mg/mL prior to conjugation with 
activated antibody. 

The conjugation reaction between the free thiol 
!5 on Gelc2*7 and the derivitized hel Fab-M2IT-TNB, conditions 
were as follows. A 5-fold excess of the gelonin analog was 
added to activated hel Fab-M21T-TNB (both proteins were in 
O.lM Na phosphate, 0.2M NaCl, pH7.5) and the reaction 
mixture was incubated for 3.5 hours at room temperature and 
0 then overnight at 4C. Following conjugation, untreated 
M2IT linkers were quenched with 1:1 mole cysteamine to 
linker for 15 minutes at room temperature. The quenched 
reaction solution was loaded onto a gel filtration column 
(G-75) equilibrated with 10 mM Tris, 150 mM NaCl, pH 7. 
5 The first peak off this column was diluted to 30 mM NaCl 
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with 10 mM Tris, pH7 and loaded on Blue Toyopearl®. The 
product was eluted with 10 mM Tris, 0.5 M NaCl, pH 7.5, 

(ii) hel Fab^-Geln ,., ^ 

Similarly, the H65 hel Fab' fragment was dialyzed 
5 into 25 mM TEOA buffer, 400 inM NaCl, pH 8 at 2.9 rog/mL 

prior to derivitization with the M2IT crosslinXer . For the 
linker reaction, M2IT was used at 20-fold molar excess, in 
-the presence of 2*5 mM DTNB. The reaction was allowed to 
proceed for 1 hour at room temperature then it was desalted 

10 on GF05 (gel filtration resin) and equilibrated in 0.1 M Na 

phosphate, 0.2 M NaCl, pH 7.5. A linker number of 1.6 
linkers per Fab* was calculated based on the DTNB assay. 
The hel Fab • -M2IT-TNB was concentrated to 3.7 mg/mL prior 
to conjugation with Gelc247 

15 The Gelc247 at 77 mg/mL was diluted with 10 mM Na 

phosphate, 0.1 M NaCl to a concentration of 5 mg/mL, 
treated with 1 mM DTT, 0.5 mM EDTA for 3 0 minutes at room 
temperature to expose a free thiol for conjugation and then 
was desalted on GF05 and equilibrated in 0.1 M Na 

20 phosphate, 0.2 M NaCl, pH 7.5. Free thiol content was 

determined to be 1.48 moles of free SH per mole of Gelc247 
using the DTNB assay. The Gelc2«7 was concentrated to 10 
mg/mL prior to conjugation with activated hel Fab»-M2IT- 
TNB. 

25 For the reaction between the free thiol on Gelc2«7 

and the derivitized hel Fab'-M2IT-TNB, conditions were as 
follows. A 5.7-fold molar excess of gelonin was added to 
activated hel Fab»-M2IT--TNB and the final salt 
concentration was adjusted to 0.25 M. The reaction mix was 

3 0 incubated for 1.5 hours at room temperature and then over 

the weekend at 4*C. Following conjugation, unreacted M2IT 
linkers were quenched with 1:1 mole cysteamine to linker 
for 15 minutes at room temperature. The quenched reaction 
solution was loaded onto a gel filtration column (AcA54) 

35 equilibrated with 10 mM Tris, 250 mM NaCl, pH 7.5. The 
first peak off this colusm was diluted to 20 mM NaCl with 
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10 mM Tris, pH 7 and loaded on Blue Toyopearl® which was 
equilibrated in 10 mM Tris, 20 mM NaCl, pH ?• The column 
was then washed with 10 mM Tris, 30 mM Nacl, pH 7.5. The 
product was eluted with 10 mM Tris, 1 M NaCl, pH 7.5. 

5 (iii) he2::Fab_GelA50(CA4) 

The he2-Fab was dialyzed overnight into 25 mM 
TEOA, 0.25 M NaCl, pH 8 buffer and then concentrated to 
13.3 mg/mL prior to derivitization with the M2IT 
' crosslinker. For the linker reaction, M2IT was used in a 

10 20-fold molar excess in the presence of 2.5 mM DTNB. The 

reaction was allowed to proceed for 20 minutes at room 
temperature and was then desalted on a GF05-LS (gel 
filtration) column, equilibrated in 0*1 M Na phosphate, 0.2 
M NaCl with 0.02% Na azide. A linker number of 1*7 linkers 

15 per Fab-M2IT-TNB was calculated based on the DTNB assay. 

After derivitization and gel filtration, the he2-Fab 
concentration was 5.2 mg/mL. 

GelA50(C4o at 8.33 mg/mL in 10 mM Na phosphate, pH 
7.2 was treated with 5 mM DTT and 0.5 mM EDTA for 30 

20 minutes at room temperature to; expose a reactive thiol for 
conjugation and then was desalted on GF05-LS resin 
equilibrated in 0.1 M Na phosphate, 0.1 M NaCl with 0.5 mM 
EDTA plus 0.02% Na azide, pH 7. 5. Free thiol content was 
determined to be 0.83 moles of free SH per mole of Gel^jocc^*) 

25 using the DTNB assay. The gelonin was concentrated to 11.4 

mg/mL prior to conjugation with activated he2-Fab. 

The conjugation reaction conditions between the 
free thiol on GelA50(C4M derivitized he2-Fab-M2IT-TNB 

were as follows. A 3-fold excess of the gelonin analog was 

30 added to activated he2-Fab-M2IT-TNB (both proteins were in 
0.1 M Na phosphate, 0.1 M NaCl, pH 7.5 but the gelonin 
solution contained 0.5 mM EDTA as well) • The reaction 
mixture was concentrated to half its original volume, then 
the mixture was incubated for 4 hours at room temperature 

35 followed by 72 hours at 4*C. Following the incubation 
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period the efficiency of conjugation was estimated at 70- 
75% by examination of SDS PAGE* 

Following conjugation the excess M2IT linkers 
were quenched by incubation with 1:1 mole cysteamine to 
5 linker for 15 minutes at room temperature. The quenched 

reaction as loaded onto a gel filtration column (G-75) 
equilibrated in 10 mM Tris, 0.15 M NaCl, pH 1 . The first 
peak off this column was diluted to 30 mM NaCl with 10 mM 
Tris, pH 7 and loaded onto a Blue Toyopearl® (TosoHaas) 
10 'column. The product was eluted with 10 mM Tris, 1 M NaCl, 
pH 7.5. 

(iv) he3rFab_Gelj,50(c«M 

Similarly, the he3-Fab was dialyzed overnight 
into 25 mM TEOA, 0.25 M NaCl, pH 8 buffer and then 

15 concentrated to 5 mg/mL prior to derivitization with the 

M2IT crosslinker. For the linker reaction, M2IT was used 
in a 10-fold molar excess in the presence of 2.5 mM DTNB. 
The reaction was allowed to proceed for 45 minutes at room 
temperature and was then desalted on a GF05-LS (gel 

20 filtration) column, equilibrated in 0.1 M Na phosphate, 0.2 

M NaCl with 0.02% Na azide. A linker number of 1 M2IT per 
Fab-M2IT-TNB was calculated based on the DTNB assay. After 
derivitization and gel filtration, the he3-Fab 
concentration was 5.3 mg/mL. 

25 GelA50(C44) at 7.8 mg/mL in 0.1 M Nai phosphate, 0.1 

M NaCl, pH 7.5 was treated with 1.5 mM DTT and 1 mM EDTA 
for 30 minutes at room temperature to expose a reactive 
thiol for conjugation and then was desalted on GF05-LS 
resin equilibrated in 0,1 M Na phosphate, 0.1 M NaCl plus 

3 0 0.02% Na azide, pH 7.5. Free thiol content was determined 

to be 0.66 moles of free SH per mole of GelA50(C44) using the 
DTNB assay. The gelonin was concentrated to 5.2 mg/mL 
prior to conjugation with activated he3-Fab. 

The conjugation reaction conditions between the 

35 free thiol on Gely^so^c**) aJ^d the derivitized he3-Fab-M2IT-TNB 

were as follows. A 5-fold excess of the gelonin analog was 
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added to activated he3-Fab-M2IT-TNB (both proteins were in 
0.1 M Na phosphate 0.1 M NaCl, pH 7.5). The reaction 
mixture was incubated for 2 hours at room temperature 
followed by 72 hours at 4*C. Following the incubated 
period the efficiency of conjugation was estimated at 70- 
75% by examination of SDS PAGE. 

Following conjugation^ the excess M2IT linkers 
were quenched by incubation with 1:1 mole cysteamine to 
linker for 15 minutes at room temperature. The quenched 
reaction was loaded onto a GammaBind G {immobilized protein 
G affinity resin, obtained from Genex;, Gaithersburg, 
Maryland) equilibrated in 10 mM Na phosphate, 0.15 M NaCl, 
pH 7. It was eluted with 0.5 M NaOAc, pH 3 and neutralized 
with Tris. It was dialyzed into 10 mM Tris, 0.15 M NaCl, 
pH 7 overnight, then diluted to 30 inM NaCl with 10 mM Tris, 
pH 7 and loaded onto a blue Toyopearl® (TosoHaas) column. 
The product was eluted with 10 mM Tris^ 1 M NaCl, pH 7.5 

Example a 

Whole Cell Kill Assays 
20 Immunoconjugates prepared with gelonin and 

gelonin analogs were tested for cytotoxicity against an 
acute lymphoblastoid leukemia T. cell line {HSB2 cells) and 
against human peripheral blood mononuclear cells (PBMCs) . 
Immunoconjugates of ricin A-chain with H65 antibody (H65- 
25 RTA) and antibody fragments were also tested. The ricin A- 
chain (RTA) as well as the H65-RTA immunoconjugates were 
prepared and purified according to methods described in 
U.S. Patent Application Serial No. 07/306,433, supra and in 
International Publication No. WO 89/06968, supra. 
30 Briefly, HSB2 cells were incubated with 

immunotoxin and the inhibition of protein synthesis in the 
presence of immunotoxin was measured relative to untreated 
control cells. The standard immunoconjugates H65-RTA (H65 
derivitized with SPDP linked to RTA) , H65-Gelonin and H65- 
35 rGelonin, H65 fragment ijoamunoc on jugate, and gelonin 
immunocon jugate samples were diluted with RPMI without 
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leucine at half-log concentrations ranging from 2000 to 
0.632 ng/ml. All dilutions were added in triplicate to 
wells of microtiter plates containing 1 x 10^ HSB2 cells per 
well. HSB2 plates were incubated for 20 hours at 37 *C and 
5 then pulsed with ^H-Leu for 4 hours before harvesting. 
Samples were counted on the Inotec Trace 96 cascade 
ionization counter. By comparison with an untreated 
sample, the picomolar concentration (pM) of immunotoxin 
which resulted in a 50% inhibition of protein synthesis 

10 (IC50) was calculated. In order to normalize for conjugates 

containing differing amounts of toxin or toxin analog, the 
cytotoxicity data were converted to picomolar toxin (pM T) 
by multiplying the conjugate IC50 (in pM) by the 
toxin/antibody ratio which is unique to each conjugate 

15 preparation. 

The PMBC assays were performed as described by 
Fishwild et al., Clin, and Exp. Jmmunol., 86:506-513 (1991) 
and involved the incubation of immunoconjugates with PBMCs 
for a total of 90 hours. During the final 16 hours of 

2 0 incubation, ^H-thymidine was added; upon completion, 

immunoconjugate-induced inhibition of DNA synthesis was 
quantified; • The activities of the H65 and chimeric H65 
antibody conjugates against HSB2 cells and PBMC cells are 
listed in Table 2 below. 



yVO 94/26910 



PCTAJS94/05348 



-73-- 

Table 2 

IC50 (PM T) . 

Conjugate HS62 Cells PBMCs 

H65-RTA 143 459 

5 H65-(M2IT)-S-S-(M2IT)-Gelonin 1770 81 

H65- (M2IT) -S-S- (M2IT) -rGelonin 276 75 

H65-(M2IT)-S-S-Gelcio 14 0 28 
H€5-(M2IT)-S-S-GelA50(C44) 51 

H65-(M2IT)-S-S-Gelc239 2328 180 

10 ' H65-(M2IT)-S-S-Gelc2(i* >5000 >2700 

H65- (M2IT) -S-S-Gelc2*7 41 35 

H65-(M2IT)-S-'S-Gelc2A8 440 203 

CH65-RTA30 60 400 

cH65- (M2IT) -S-S- {M2IT) -Gelonin 1770 140 

15 cH65-(M2IT)-S-S-(M2IT)-rGelonin 153 120 

cH65-(M2IT)-S-S-Gelc239 >7000 290 

CH65-'(M2IT)-S-S-Gelc2*7 ^4 60 

cH65-(M2IT)-S-S-Gelc24B 238 860 
H65-(M2IT)-S-S-GelA44(c50) ^38 ND* 

20 H65- (M2IT) -S-S-Gelc2«7A*»4A50 71 ND* 

* — Not determined. 



25 



30 



35 



Against HSB2 cells, many of the gelonin analog 
immunoconjugates were significantly more potent than 
conjugates prepared with native gelonin or recombinant, 
unmodified gelonin, both in terms of a low IC50 value, but 
also in terms of a greater extent of cell kill. Against 
human PBMCs, the gelonin analog conjugates were at least as 
active as native and recombinant gelonin conjugates. 
Importantly, however, some of the conjugates (for example, 
Geldor GelA50(c«4) and Gelc247) exhibited an enhanced potency 
against PBMCs compared to native and recombinant gelonin 
conjugates, and also exhibited an enhanced level of cell 
kill. 

The activities of the H65 antibody fragment 
conjugates against HSB2 cells and PBMC cells are listed in 
Tables 3 and 4 below, wherein extent of kill in Table 3 
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refers to the percentage of protein synthesis inhibited in 
HSB2 cells at the highest inanunotoxin concentration tested 

(1 fig/ial)- - •. 



10 





Table 3 






ICjo (PM T). 




Coniuqate 


HSB2 Cells 


PBMCs 


cH65Fab'-RTA 30 


530 


1800 


cH65Fab • -rGelonin 


135 


160 


cH65Fab'-Gelc2»7 


48 


64 


cH65F(ab" )2-RTA 30 


33 


57 


CH65F ( ab • ) j-rGelonin 


55 


34. 


cH65F(ab')2-Gelc247 


23 


20 


cH65F(ab')2-Gelc2«e 


181 


95 



15 



20 



Coniuaatie 

hel Fab'-Gelc247 

hel Fab-Gelc2«7 

he2-Fab-GelA50(C4O 

he3-Fab-GelA5occ4*) 
cH65Fab'-Gelc247 

CH65F { ab • ) j-rGelonin 

cH65F(ab«)2-Gelc247 

cH65F{ab»)2-Gelc247 



Table 4 

IC50 (pM T) 

HSB2 Cells 
57.7 

180.0 

363.0 
191.0 

47.5 

45.4 . 

77.5 

23.2 



Extent: of Kill 
93% 

94% 

91% 
93% 
93% 
85% 
83% 
85% 



25 



The data in Table 3 show that monovalent (Fab or 
Fab') fragments conjugated to various forms of gelonin are 
more potent than RTA conjugates. Table 4 shows that the 
human-engineered gelonin-Fab conjugates exhibit a very high 
degree of extent of kill. 
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Exaunple 9 

Properties Of Gelonin Iimunoconiugates 

A. Solubility 

Recombinant gelonin and the gelonin analogs 
5 exhibited enhanced solubility in comparison to both native 

gelonin and RTA30* In addition, recombinant gelonin and 
gelonin analog immunoconjugates exhibited enhanced 
solubility relative to immunoconjugates prepared with 
native gelonin and RTA30. This enhanced solubility was 
10 ' particularly noteworthy for recombinant gelonin and analog 
conjugates prepared with chimeric Fab fragments. 

B . Disulfide Bond Stability Assay 

The stability of the disulfide bond linking a RIP 
to a targeting molecule (such as an antibody) is known to 

15 influence the lifiespan of immunoconjugates in vivo [See 

Thorpe et al.. Cancer Pes., 47:5924-5931 (1987), 
incorporated by reference herein]. For example/ conjugates 
in which the disulfide bond is easily broken by reduction 
in vitro are less stable and less efficacious in animal 

20 models [-See Thorpe et al.. Cancer J?es., 48:6396-6403 

(1988), incorporated by reference herein]. 

Immunoconjugates prepared with native gelonin, 
recombinant gelonin and gelonin analogs were therefore 
examined in an in vitro disulfide bond stability assay 

25 similar to that described in Wawrzynczak et al.. Cancer 

Res. J. 50:7519-7526 (1990), incorporated' by reference 
herein. Conjugates were incubated with increasing 
concentrations of glutathione for 1 hour at 37 'C and, after 
terminating the reaction with iodoacetamide, the amount of 

30 RIP released was quantitated by size^exclusion HPLC on a 

TosoHaas TSK-G2000SW column. 

By comparison with the amount of RIP released by 
high concentrations of 2-mercaptoethanol (to determine 100% 
release) , the concentration of glutathione required to 

35 release 50% of the RIP (the RC50) was calculated. The 
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results of assays for H65 antibody conjugates are set out 
in Table 5 below. 

Table 5 



Conjugate RCg ^ finM) 

5 H65-RTA 30 3.2 

H65-(M2iT)-S-S-(M2IT)-gelonin 11.1 

H65- (M2IT) -S-S- (M2IT) -rGelonin 3 . 0 

H65-(M2IT)-S-S-Gelcio 2.5 

H65-(M2IT)-S-S-Gel;ijo,c*«, 0.6 

10 H65-(M2IT)-S-S-Gelc239 774.0 

H65-(M2IT)-S-S-Gelc2«« 1.-2 

H65-(M2IT)-S-S-Gelc2*7 0.1 

H65-(M2IT)-S-S-Gelc248 0.4 

CH65-RTA 3 0 2-50 

15 CH65-(M2IT) -S-S- (M2IT) -rGelonin 2.39 

CH65- (M2IT) -S-S-Gelc2*7 0.11 

CH65-(M2IT)-S-S-Gelc248 0.32 

H65-(M2IT)-S-S-GelA„,c50) 9.2 

H65- (M2IT) -S-S-Gelc2*7A»«A50 0 .3 



20 The foregoing results indicate that the stability of the 
bonds between the different gelonin proteins and H65 
antibody varied greatly. With the exception of Geldo and 
Gelc239> most of the gelonin analogs resulted in conjugates 
with linkages that were somewhat less stable in the in 

25 vitro assay than the dual-linker chemical conjugate. The 
stability of the Gelc239 analog, however, was particularly 
enhanced. 

The results of the assay for H65 antibody 
fragment conjugates are set out in Table 6 below. 
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Table 6 

Conjugate RC c n (tM\ 

hel Fab'-Gelc247 0.07 

cFab'-'Gelonin 1.27 

5 cFab»-Gelc247 0.08 

cF(ab*)2"I^TA 30 1.74 

cF (ab' ) 2-rGelonin 2.30 

cF(ab» )2^Gelc2o 0.09 

cF(ab*)2-"Gelc248 0.32 

m 

10 he2-Fab-GelA50(C44) 0.4 6 

he3-Fab-GelA50(c*4) 0.58 

From the RC50 results presented in Tables 5 and 6, 
it appears that the particular RIP analog component of each 
immunotoxin dictates the stability of the immunotoxin 
15 disulfide bond in vitro* 

Example 10 

Pharmacokinetics Of Conjugates To H65 Antibody 

The pharmacokinetics of gelonin analogs Gelcj*?, 
GelA50(C44i)r and Geldo linked to whole H65 antibody was 

20 investigated in rats. An IV bolus of 0.1 mg/kg of ^^^I- 

labelled immunocon jugate H65- (M2IT) -S-S-Gelc2A7# H65--{M2IT)- 
S-S^GelA50(C4A) or H65-(M21T)"S-S-Gelcio was administered to 
male Sprague-Dawley rats weighing 134-148 grams. Serum 
samples were collected from the rats at 3, 15, 30 and 45 

25 minutes, and at 1.5, 2, 4, 6, 8, 18, 24, 48, 72, and 96 
hours. Radioactivity (cpm/ml) of each sample was measured, 
and SDS-PAGE was performed to determine the fraction of 
radioactivity associated with whole immunocon jugate. 
Immunoconjugate-associated servim radioactivity was analyzed 

30 using the computer program PCNONLIN (SCI Software, 
Lexington, Kentucky) . Table 7 below lists the 

pharmacokinetic parameters of the immunocon jugates. In 
that table, the standard error for each value is indicated 
and a one way analysis of variance is presented, IC is the 

35 immunocon jugate (specified by the abbreviation for the 
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gelonin variant that is part of the immunoconjugate) , n is 
the number of animals in the study, Vc is the central 
volume of distribution, Cl is the clearance, MRT is the 
total body mean residence time. Alpha is the a half --life 
5 and Beta is the jS half -life of the immunoconjugate. 
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The Gelc247 imnunoconjugate was found to have a and 
P half lives of 2.3 and 20 hours, with a total mean 
residence time of 17 hours. The 7 2 and 9 6 hour time points 
were excluded from analysis because of the poor resolution 
of immunoconjugate associated radioactivity on the SDS-PAGE 
gel for these serum samples. 

Because in vitro studies suggested that the Gelcio 
immunoconjugate had greater disulfide bond stabilit)^ it 
was anticipated that its half lives in vivo would be longer 
relative to the cyszi,^ form of the immunoconjugate. The /S 
half life of the immunoconjugate was about 33 hours 
compared to 20 hours for the Gelc247 conjugate. The total 
mean residence time was also much greater for the Gel^.n 
immunoconjugate (42 hours versus 42 hours for the Gelj^y 
conjugate) . In addition, the clearance of the Geldo 
immunoconjugate was 2.5 ml/hr/kg, about four times less 
than that of the Gelc2o immunoconjugate (ll ml/hr/kg). As 
also predicted from the in vitro disulfide stability data, 
the clearance of the Gel^sotc^A) immunoconjugate was 
intermediate between those of the Geldo and Gelc247 
immunoconjugates. 

Based on these studies, the Geldo analog 
conjugated to H65 antibody has greater in vivo stability 
than the GelA5o<c44) Gelc247 analogs conjugated to H65 

antibody (as determined by the longer mean residence time 
and clearance rates) , although the properties of the 
G^1a50(C44) immunoconjugate more closely resembled those of the 
Gelcio immunoconjugate than the Gelc247 immunoconjugate. 

Example 11 

Pharmacokinetics Of Conjugates To H65 Antibody Frag ingn-hg 

The pharmacokinetics of Gelc247 and GelA50(c*«) 
analogs linked to human engineered H65 Fab fragments were 
also investigated in rats. An IV bolus of 0.1 mg/kg of ^"l- 
labelled hel H65 Fab-Gelc247/ he2 H65 Fab-GelA50(C4A) or he3 H65 
Fab-GelA50(c*A) was administered to male Sprague-Dawley rats 
weighing 150-180 grams. Serum samples were collected at 3, 
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5, 15, 20, 30, and 40 minutes, and 1, 1.5, 3, 6, 8, 18, 24, 
32, 48, and 72 hours, and were analyzed by ELISA using 
rabbit anti-Gelonin antibody as the capture antibody and 
biot in-labelled goat anti-human kappa light chain antibody 
5 as the secondary antibody. Results of the analysis are 

presented in Table 8 below. In the table, the standard 
error for each value is shown, and IC is the 
inanunocon jugate, n is the number of animals in the study, 
Vc is the central volume of distribution, Vss is the steady 
10 state voliune of distribution, Cl is the clearance, MRT is 

the total body mean residence time. Alpha is the a half- 
life and Beta is the /S half-life of the indicated 
conjugate. 
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Comparing the three immunoconjugates , the 
pharmacokinetics of hel H65 Fab-Gelc2A7i he2 H65 Fab-GelA50(C4M 
and he3-Fab-GelA50(c*4) were very similar, having similar alpha 
and beta half-lives^ mean residence times, and clearance, 
particularly when; comparing parameters obtained from the 
ELISA assayed curves. This is in contrast to their whole 
antibody immunocon jugate counterparts, where the clearance 
of Gelc2A7 immunocon jugate (11 ml/kg/hr) was three-fold 
greater than that of Gel^s^^ci,*) immunocon jugate (4 ml/kg/hr) . 
This suggests that cleavage of the disulfide bond linking 
the Fab fragment and gelonin is not as important for the 
serum clearance of Fab immunoconjugates as for whole 
antibody immunocon jugates. 

Example 12 

15 Immunoqenicitv Of Immunoconiuqates 

Outbred Swiss/ Webster mice were injected 
repeatedly (0.2 mg/kg each injection) with murine H65 
antibody conjugates prepared with RTA, RTA30 and 
recombinant gelonin. The cycle was such that each animal 

20 was injected on days 1 and 2, and then the injections were 

repeated 28 and 29 days later. The animals received 5 such 
cycles of injections. One week and three weeks following 
each series of injections, blood was collected and the 
amount of anti-RIP antibodies present was determined by 

25 ELISA; peak titers for each cycle are shown in Table 9. 

RTA and RTA30 generated strong responses which began 
immediately following the first cycle of injections and 
remained high throughout the experiment. In contrast, no 
immune response was detected for the gelonin conjugate, 

30 even after 5 cycles of injections. When the conjugates 

were mixed with Complete Freund Adjuvant and injected i.p. 
into mice, anti-RTA and RTA-30 antibodies were readily 
detected after several weeks. These data indicate that 
anti-gelonin antibodies, if generated, would have been 

35 detected by the ELISA assay, and suggest that recombinant 
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gelonin may 


be much less 


immunogenic in 


animals than is 


RTA. 




• 








Table 9 




Cycle 


H65-RTA 


H65-RTA3 0 


H65-rGel 


Prebleed 


100 


100 


100 


Cycle 1 


168 


117 


100 


Cycle 2 


4208 


1008 


100 


^Cycle 3 


7468 


3586 


100 


Cycle 4 


5707 


3936 


100 


Cycle 5 


4042 


2505 


100 



Example 13 

In vivo Efficacy Of Immunoconiuoates 

A human peripheral blood lymphocyte (PBL)- 
reconstituted, severe combined immunodef icient mouse model 
15 was utilized to evaluate the in vivo efficacy of various 

immunoconjugates comprising the gelonin analogs Gelca^y and 
GelA50(c**)* Immunoconjugates were tested for the capacity to 
deplete human blood cells expressing the CDS antigen. 

A. Human PBL Donors And Cell Isolation 

20 Human peripheral blood cells were obtained from 

lymphapheresis samples (HemaCare Corporation, Sherman Oaks, 
CA) or venous blood samples (Stanford University Blood 
Bank, Palo Alto, CA) collected from healthy donors. Blood 
cells were enriched for PBLs using Ficoll-Hypaque density 

25 gradient centrif ugation (Ficoll-Paque®; Pharmacia, 

Piscataway, New Jersey) and subsequently washed 4 times 
with PBS. Residual erythrocytes were lysed with RBC lysing 
buffer (16 MM ammonium chloride, 1 mM potassium 
bicarbonate, 12.5 EDTA) during the second wash. Cell 

30 viability in the final suspension was >95% as assessed by 

trypan blue dye exclusion. 
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B. Animals And Human PBL Transfer 

CB»17 scid/scid (SCID) mice were purchased from 
Taconic (Germantovn, New York) or were bred under sterile 
conditions in a specific pathogen-free animal facility 
5 (original breeding pairs were obtained from Hana Biologies, 

Alameda, California). Animals were housed in filter-top 
cages and were not administered prophylactic antibiotic 
treatment. Cages , bedding, food and water were autoclaved 
before use. All manipulations with animals were performed 

10 - in a laminar flow hood. 

Untreated SCID mice were bled for determination 
of mouse Ig levels. Human PBL- injected mice were bled at 
various intervals for quantitation of *human Ig and sIL-2R. 
Blood collection was from the retro-orbital sinus into 

15 heparinized tubes. Blood samples were centrifuged at 300 

X g for 10 min, and plasma was collected and stored at 
-70 'C. Mouse and human Ig were quantified using standard 
sandwich ELISAs. Briefly, flat-bottom microtiter plates 
(MaxiSorp Immuno-Plates, Nunc, Roskilde, Denmark) were 

20 coated overnight at 4*C with goat anti-mouse IgG+IgA+IgM 

(Zymed Laboratories, Inc., South San Francisco, California) 
or goat anti-human Igs (Tago, Inc., Burl ingame, Calif ornia) 
in bicarbonate buffer, pH 9.6. Plates were blocked for 2 
hours at room temperature with 1% BSA in Tris-buf f ered 

25 saline, pH 7.5 (TBS), and then incubated at 37 *C for 1 hour 

with standards or samples serially-diluted in TBS/1% 
BSA/0.05% Tween 20. Standards used werei a monoclonal mouse 
IgG2a (INDl anti-melanoma; XOMA Corporation, Berkeley, 
California) and polyclonal human Ig (Sigma Chemical Co., 

30 St. Louis, Missouri). Subsequently, plates were washed 
with TBS/Tween 20 and incubated at 37 'C for 1 hour with 
alkaline phosphatase-conjugated goat anti-mouse IgG+IgA-fIgM 
or goat anti-human Igs (Caltag Laboratories, South San 
Francisco, California) . Detection was by measurement of 

35 absorbance at 405 nm following incubation with 1 mg/ml p- 

nitro-phenylphosphate (Sigma) in 10% diethanolamine buffer, 
pH 9.8. Plasma from a normal BALB/c mouse was used as a 
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positive control in the mouse Ig ELISA. Plasma samples 
from naive SCID mice or normal BALB/c mice did not have 
detectable levels of human Ig» Human sIL-2R was quantified 
using an ELISA kit (Immunotech S.A., Marseille, France) as 
5 per the manufacturer's instructions. 

Five-to-seven week old mice with low plasma 
levels of mouse Ig (<10/ig/ml) were preconditioned with an 
i.p. injection of cyclophosphamide (Sigma) at 2 00 mg/kg. 
Two days later, they were injected i.p. with 25-40 x 10* 
10 freshly-isolated human PBL suspended in 0.8 ml PBS. 

C. Immunoconiugate Treatment 

SCID mice were bled at approximately 2 weeks 
after himan PBL transplantation. Mice with undetectable 
(<10 pM) or low plasma levels of human sIL-2R were 

15 eliminated from the study. The cut-off for exclusion of 

mice with detectable, but low, levels of hvunan sIL-2R was 
empirically determined for each study and was generally 20 
pM. The remaining mice were divided into groups and were 
administered vehicle or immunoconjugate as an i.v. bolus 

2 0 (0.2 mg/kg) daily for 5 consecutive days. Animals were 

sacrificed 1 day after cessation of treatment for 
quantitation of human T cells in tissues and hvuaan sIL-2R 
in plasma. 

D. Collection Of Tissues And Analysis Of PBL Depletion 
25 Blood was collected from the retro-orbital sinus 

into heparinized tubes. Mice were then killed by cervical 
dislocation and spleens were removed aseptically. Single 
cell suspensions of splenocytes were prepared in HBSS by 
pressing the spleens between the frosted ends of sterile 
30 glass microscope slides. Collected cells were washed twice 
with PBS. Erythrocytes were eliminated from blood and 
splenocyte suspensions using RBC lysing buffer. 
Subsequently, cells were resuspended in PBS for 
enumeration. Recovered cells were then assayed for Ag 
5 expression using flow cytometry. 
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Two to five hundred thousand cells in 100 /il of 
PBS/1% BSA/0.1% sodium azide were incubated on ice for 30 
min. with saturating amounts of various FITC- or 
phycoerythrin (PE) -conjugated Abs (Becton-Dickinson, 
5 Mountain View, CA) Abs used for staining included: HLe-i- 
FITC (IgGl anti-CD45) , Leu 2-FITC (IgGl anti-CD8) , Leu 3 PE 
(IgGl anti-CD4), and Leu M3-PE (IgG2a anti-CD14). Cells 
were then washed in cold buffer and fixed in 0*37% 
formaldehyde in PBS. Samples were analyzed on a FACscan 

10 • (Becton^Dickinson) using log amplifiers. Regions to 
quantify positive cells were set based on staining of cells 
obtained from naive SCID mice. The absolute numbers of 
human Ag-positive cells recovered from SCID tissues were 
determined by multiplying the percent positive cells by the 

15 total number of cells recovered from each tissue sample. 

The total number of leukocytes in blood was calculaited 
using a theoretical blood volume of 1.4 ml/mouse. The 
detection limit for accurate quantitation of human cells in 
SCID mouse tissues was 0,05%. All statistical comparison 

20 between treatment groups were made using the Mann-Whitney 

U test. Treatment groups were determined to be 
significantly different from buffer control groups when the 
p value was <0.05. Results are presented in Table 10 
below, wherein + indicates a significant difference from 

25 controls, - indicates an insignificant difference and NT 

means the conjugate was not tested ♦ CDS Plus (XOMA 
Corporation, Berkeley, California) is mouse H65 antibody 
chemically linked to RTA and is a positive control. 0X19 
Fab-Gelc2A7 is a negative control immunocon jugate. The 0X19 

30 antibody (European Collection of Animal Cell Cultures 

#84112012) is a mouse anti-rat CDS antibody that does not 
cross react with human CDS. 
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Table 10 

Test Article Human T Cell Depletioh 

— Spleen Blood 
CD5 Plus + + 

5 cH65 F(ab')2 

cH65 Fab' 

H65-rGEL + + 

CH65 F(ab')2-rGel + + 

cH65 Fab'-rGel + + 

10 cH65 F(ab*)2-Gelc247 + NT 

cH65 Fab'^Gel^2<i7 + + 

helH65 Fab»-Gelc2*7 + ^ NT 

cH65 Fab»-GelA5o<C4A) + + 
0X19 Fab-Gele2o 

15 All the gelonin immunoconjugates were capable of depleting 

huanan cells in the SCID mouse model. 

Example 14 

Construction Of Gelonin 

Immunofusions With Chimeric Antibodies 

20 Several genetic constructs were assembled which 

included a natural sequence gelonin gene fused to an H65 
truncated heavy chain gene {Fd or Fd'), or an H65 light 
chain gene (kappa). In this Example, H65 Fd, Fd', and H65 
light chain refer to chimeric constructs. The H65 Fd 

25 sequence consists of the nucleotides encoding the murine 

H65 heavy chain variable (V) , joining (J) and himan IgGi, 
constant (C) domain 1 regions, including the cysteine bound 
to light chain IgG^ and has the carboxyl terminal sequence 
SCDKTHT (SEQ ID NO: 13 0) . The H65 Fd' sequence has the H65 

30 Fd sequence with the addition of the residues CPP from the 
hinge region of human IgG. heavy chain, including a cysteine 
residue which is bound to the other human IgGj heavy chain 
and its F(ab')2 fragment. See Better, et al., Proc. Nat. 
Acad. Sci. (USA) ^ 90: 457-461 (1993), incorporated by 

35 reference herein. 
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The H65 light chain sequence consists of the 
nucleotides encoding the murine H65 light chain variable 
(V) , joining (J) , and human kappa (C^) regions. The DNA 
sequences of the V and J regions of the H65 Fd and kappa 
5 fragment genes linked to the pelB leader can be obtained 
from GenBank (Los Alamos National Laboratories^ Los Alamos, 
New Mexico) under Accession Nos. M9 04 68 and M9 04 67, 
respectively. Several of the gene fusions included a 
gelonin gene linked at the 5* end of an H65 Fab fragment 

10 * gene while the others included a gelonin gene linked at the 
3« end of an H65 Fab fragment gene, A DNA linker encoding 
a peptide segment of the E. coll shiga-like toxin (SLT) 
(SEQ ID NO: 56), which contains twb cysteine residues 
participating in a disulfide bond and forming a loop that 

15 includes a protease sensitive amino acid sequence) or of 

rabbit muscle aldolase [ (RMA) as in SEQ ID NO: 57, which 
contains several potential cathepsin cleavage sites] was 
inserted between the gelonin gene and the antibody gene in 
the constructs. Alternatively, a direct fusion was made 

20 between a gelonin gene and an H65 Fab fragment gene without 

a peptide linker segment. Table 11 below sets out a 
descriptive name of each gene fusion and indicates the 
expression plasmid containing the gene fusion and the 
section of the application in which each is diesignated. 

25 Each plasmid also includes the Fab fragment gene (shown in 
parentheses in Table 11) with which each particular gene 
fusion was co-expressed. The inclusion of a cysteine from 
a hinge region (Fd') allows potential formation of either 
monovalent Fab' or bivalent F(ab')2 forms of the expression 

30 product of the gene fusion. 
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Table 11 



Section 


Plasmid 


Description 


~B{4=-) 


-pING3-X54 


Gelon-in SLT-: : Pd/ (kappa) 


B(ii) 


PING3757 


Gelonin: : SLT: : kappa (Fd' ) 


5 B(iii) 


pING3759 


Gelonin: :RMA: :Fd' (kappa) 


B(iv) 


pING3758 


Gelonin : : RMA : : kappa ( Fd ' ) 


A(i) 


PING4 4 06 


Fdi :SLT: : Gelonin (kappa) 


A(ii) 


pING44 07 


kappa: :SLT: : Gelonin (Fd) 


A(iii) 


PING44 08 


Fd: :RMA: : Gelonin (kappa) 


10 A(iv) 


PING4410 


kappa: :RMA: : Gelonin (Fd) 


C(i) 


PING3334 


Gelonin: :Fd (kappa) 



A. Fusions Of Gelonin At The 

Carboyyl-Tenninus Of Antibody Genes 

(i) Fd; :SLT: ; Gelonin f kappa ^ 

15 A gelonin gene fusion to the 3 •-end of the H65 Fd 

chain with the 23 amino acid SLT linker sequence was 

assembled in a three piece ligation from plasxnids pVKl, 

PING3731 (ATCC 68721) and pING4 000. Plasmid pVKl contains 

the Fd gene linked in-rframe to the SLT linker sequence and 

2 0 some H65 Fd' and kappa gene modules as in pING3217, shown 

in Better^ et aJ.^ ProcNat. Acad. Sci . (USA) t 457-461 
(1993)^ except that the kappa and Fd' regions are reversed* 
Plasmid pING3731 contains the gelonin gene, and pING4000 
contains the H65 kappa and Fd' genes each linked to the 
25 pelB leader sequence under the control of the araB promoter 
as a dicistronic message. 

Plasmid pVKl was designed to link the 3'-end of 
a human IgG Fd constant region in-frame to a prot ease- 
sensitive segment of the SLT gene bounded by two cysteine 

3 0 residues which form an intra-chain disulfide bond. The SLT 

gene segment (20 amino acids from SLT bounded by cysteine 
residues, plus three amino acids, introduced to facilitate 
cloning) was assembled from two oligonucleotides, SLT 
Linker 1 and SLT Linker 2. 
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SLT Liliker 1 (SEQ ID NO: 73) 

5 » TGTGATCATCATGCATCGCGAGTTGCCAGAATGGCATCT 

GATGAGTTTCCTTCTATGTGCGCAAG^^^ 3» 

SLT Linker 2 (SEQ ID NO: 74) 
5 5 " TCGAGAGTACTTGCGCACATAGAAGGAAACTCATCAGAT 

GCCATTCTGGCAACTCGCGATGCATGATGATGACATGCA 3 • 
The two oligonucleotides were annealed and ligated into a 
vector (pING3l85) containing PstI and Xhol cohesive ends, 
destroying the PstI site and maintaining the Xhol site. 
10 Plasmid pING3185 contained an engineered PstI site at the 

3»-end of the Fd gene, and contained an Xhol site 
downstream of the Fd gene* The product of this ligation, 
pVKl, contained the H65 Fd gene (fused" to the pelB leader) 
in frame with the SLT linker segment, and contained two 
15 restriction sites, Pspl and Seal, at the 3 '-end of the SLT 

linker. 

Plasmid pVKl was digested with 5aul and Seal, and 
the 217 bp fragment containing a portion of the Fd constant 
domain and the entire SLT gene segment was purified by 

20 electrophoresis on an agarose gel. pING3731 was digested 

with Sjnal and Xhol and the 760 bp gelonin gene was 
similarly purified. Plasmid pING4 000 was digested with 
Saul and Xhol and the vector segment containing the entire 
kappa gene and a portion of the Fd gene was also purified. 

25 Ligation of these three DNA fragments resulted in pING44 06 
containing the Fd: : SLT: : Gelonin (kappa) gene fusion vector. 



(ii) kappa: :SLT: : Gelonin (Fd^ 

A gelonin gene fusion to the 3'-end of the H65 
kappa chain with the 25 amino acid SLT linker sequence (20 
30 amino acids from SLT bounded by cysteine residues, plus 5 
amino acids introduced to facilitate cloning) was assembled 
from the DNA segments in pING3731 (ATCC 68721) and 
pING3713. 

Plasmid pING3713 is an Fab expression vector 
35 where the H65 Fd and kappa genes are linked in a 
dicistronic transcription unit containing the SLT linker 
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segment cloned in-frame at the 3 '-end of the kappa gene. 
The plasmid was constructed as follows. In a source 
plasmid containing the H€5 Fd and kappa genes, an Eagl site 
was positioned at the 3 •-end of the kappa gene by site 
5 directed mutagenesis without altering the encoded amino 

acid sequence. The SLT gene segment from pVKl was 
amplified with primers and Sail for in frame 

linkage to the JE^agl site at the 3*-end of the kappa gene. 
SLT-i:ag-5* (SEQ ID NO: 75) 
10 ^ 5» TGTTCGGCCGCATGTCATCATCATGCATCG 3» 

Sail (SEQ ID NO: 76) 
5» AGTCATGCCCCGCGC 3» 
The 14 0 bp PGR product was digested with JPagl and Xhol, and 
the 75 bp fragment containing the SLT gene segmen't was 
15 cloned adjacent to the Fd and kappa genes in the source 

plasmid to generate pING3713. 

For construction of gene fusion to gelonin, 
PING3713 was cut with Seal and XhoX, and the vector 
fragment containing the Fd gene and kappa:: SLT fusion was 
20 purified. pING3731 was digested with Smal and Xhol and the 

DNA fragment containing the gelonin gene was also purified. 
The product of the ligation of these two fragments, 
PING4 407, contains the Fd and kappa: : SLT: : gelonin genes. 

(iii) Fd; :RMA: : Gelonin fkappa^ 

25 A gelonin gene fusion to the 3 •-end of the H65 Fd 

chain with the 21 amino acid RMA linker sequence (20 amino 
acids from RMA, plus 1 amino acid introduced to facilitate 
cloning) was assembled in a three piece ligation from 
plasmids pSH4, pING3731 (ATCC 68721) and pING4000. 

3 0 Plasmid pSH4 contains an Fd gene linked in frame 

to the RMA linker sequence. The RMA gene segment was 
linked to the 3»-end of Fd by overlap extension PGR as 
follows. The 3 '-end (constant region) of the Fd gene was 
amplified by PGR from a source plasmid with the primers 

35 KBA-72 and RMAG-1. Any Fd constant region may be used 
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because constant regions of all human IgGj antibodies are 
identical in this region. 



KBA-72 (SEQ ID NO: 77) 
5' TCCCGGCTGTCCTACAGT 3* 
5 R14AG--1 (SEQ ID NO: 78) 

5 ' TCCAGCCTGTCCAGATGGTGTGTGAGTTTTGTCACAA 3 » 
The product of this reaction was mixed with primer RMA-76, 
which annealed to the amplified product of the first 
reaction, and the mixture was amplified with primers KBA-72 
10 and RMAK-2* 

RMA-76 (SEQ ID NO: 79) 

5 • CTAACTCGAGAGTACTGTATGCATGGTTCGAGATGAACA 

AAGATTCTGAGGCTGCAGCTCCAGCCTG'^CCAGATGG 3 • 

RMAK-2 (SEQ ID NO: 80) 
15 5' CTAACTCGAGAGTACTGTAT 3* 

The PCR product contained a portion of the Fd constant 
region linked in-frame to the RMA gene segment. The 
product also contained a 5cal restriction site useful for 
in-frame fusion to a protein such as gelonin, and an Xhol 
20 site for subsequent cloning. 

This PCR product was cut with Saul and Xhol and 
ligated adjacent to the remainder of the Fd gene to 
generate pSH4. 

For assembly of the gene fusion vector containing 
25 the Fd: :RHA: :Gelonin, kappa genes, pSK4 was cut with Saul 

and Seal and the Fd::RMA segment was purified. Plasmid 
PING3731 was cut with Smal and Jfhol and the 760 bp DNA 
fragment containing the gelonin gene was purified, and 
PING4 000 was cut with Saul and Xhol and the vector was 
30 purified. The product of the ligation of these fragments, 
PING4408, contained the Fd: : RMA: : Gelonin and kappa genes. 

(iv) kappa; ;RMA: :Gelonin fFd^ 

A gelonin gene fusion to the 3 '-end of the H65 
kappa chain with the 21 amino acid RMA linker sequence was 
35 assembled in a three piece ligation from plasmids pSH6, 
pING4408 (see the foregoing paragraph) and pING3713. 



wo 94/26910 



PCTA3S94/05348 



-94- 

Plasmid pSH6 contains a kappa gene linked in- 
frame to the RMA linker: sequence. The RMA gene segment was 
linked to the 3«-end of kappa by overlap extension PGR as 
follows. The 3»-end (constant region) of the kappa gene 
5 was amplified by PGR from a source plasmid with the primers 
KBA-K2 and RMAK-1. 

» 

RMAK-1 (SEQ ID NO: 81) 

5» TCCAGCCTGTGCAGATGGACACTCTCCCCTGTTGAA 3* 
KBA-K2 (SEQ ID NO: 82) 

10 • 5« GTACAGTGGAAGGTGGAT 3» 

The product of this reaction was mixed with primer RMA-76 
(SEQ ID NO: 81), which annealed to the amplified product of 
the first reaction, and the mixture was amplified with 
primers KBA-K2 and RMAK-2. The PGR product contained a 

15 portion of the kappa constant region linked in-frame to the 
RMA gene segment. The product also contained a Seal 
restriction site useful for in-frame fusion to a protein 
such as gelonin, and an Xhol site for subsequent cloning. 
This PGR product was cut with 5stl and Xhol and ligated 

20 adjacent ,,Jto the remainder of the kappa gene to generate 
pSH6 . 

/ For assembly of the gene fusion vector containing 
the kappa: : RMA: cGelonin and Fd genes, pSH6 was cut with 
Hindi II and PstI and the DNA fragment containing the kappa 

25 constant region and a portion of the RMA linker (the PstI 

RMA linker segment contains a PstI site) segment was 
purified. Plasmid pING44 08 was cut with PstI and Sail and 
the DNA fragment containing a segment of the RMA linker, 
the gelonin gene and a portion of. the tetracycline 

30 resistance gene in the vector segment was purified. 

pING3713 was cut with Sail and Ifindlll and the vector was 
purified. The product of the ligation of these three 
fragments, pING4410, contained the kappa: : RMA: : Gelonin and 
Fd genes. 
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B. Fusions Of Gelonin At The 

Amino-Tenninus Of Antibody Genes 

(i) Gelonin; ;SLT; ;Fd^ fkappa) 

A gelonin gene fusion to the 5*-end of the H65 
5 Fd' chain vith a 25 amino acid SLT linker sequence (20 
amino acids from SLT bounded by cysteine residues, plus 
five amino acids introduced to facilitate cloning) was 
assembled in a three piece ligation from plasmids pING3748, 
pING3217, and a PCR fragment encoding the H65 heavy chain 
10 variable region (Vg) gene segment which is the variable 

region of the Fd' gene in pING3217. Plasmid pING3748 
contains the gelonin gene linked in-frame to the SLT linker 
sequence, and pING3217 contains the H65 Fd' and kappa genes 
in a dicistronic transcription unit. 
15 Plasmid pING3825 (see Example 2) was amplified 

with PCR primers gelo3 ••-iTag and gelo-9 to introduce an Eagl, 
restriction site at the 3* -end of the gelonin gene by PCR 
mutagenesis. 

gelo3»-Eag (SEQ ID NO: 83) 
20 5» CATGCGGCCGATTTAGGATCTTTATCGACGA 3* 

The PCR product was cut with BcII and EagT and the 56 bp 
DNA fragment was purified. Plasmid pING3713 was cut with 
' EagT and Xhol, and the 77 bp DNA fragment containing the 
SLT linker was purified. The 56 bp BcII to EagH fragment 
25 and the 77 bp Eagl to XhoZ fragment were ligated into 

pING3825 which had been digested with BcII and JThoI to 
generate pING3748 which contains the gelonin gene linked 
in-frame to the SLT linker sequence. 

For assembly of the gene fusion vector containing 
30 the Gelonin: : SLT: :Fd' and kappa genes, the H65 Vg was 

amplified by PCR from pING32l7 with primers H65-G1 and H65- 
G2, and the product was treated with T4 polymerase followed 
by digestion with Ndel. 

K65-G1 (SEQ ID NO: 84) 
35 5' AACATCCAGTTGGTGCAGTCTG 3' , 

H65-G2 (SEQ ID NO: 85) 
5« GAGGAGACGGTGACCGTGGT 3» 
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The 176 bp fragment containing the 5 '-end of the H65 heavy 
chain V-region was purified. Concurrently, pING3217 was 
—digested— with Kdel^and-XhoX,. .and._the. -13.07— bp DNA— fragment 
containing a portion of the Fd' gene and all of the kappa 
5 gene was purified. The two fragments were ligated to 

pING3748 which had been digested with Seal and Xhol in a 
three piece ligation yielding pING3754 (ATCC 69102), which 
contains the Gelonin: : SLT: :Fd' and kappa genes. 

' (ii) Gelonin: :SLT: ; kappa (Fd^) 

10 A gelonin gene fusion to the 5* -end of the H65 

kappa chain with the 25 amino acid SLT linker sequence was 
assembled in a three piece ligation from plasmids pING3748 
(see the foregoing section), pING4000, and a PGR fragment 
encoding the H65 light chain variable region (Vl) gene 

15 segment. 

For assembly of the gene fusion vector containing 
the Gelonin: : SLT: :kappa and Fd' genes, an H65 Vl fragment 
was amplified by PGR from pING3217 with primers H65-K1 and 
JKl-l/indlll, and the product was treated with T4 polymerase 
20 followed by digestion with Hindlll. 

H65-K1 (SEQ ID NO: 86) 

5 • GACATCAAGATGACCCAGT 3 » . . 

JKl-Hindlll (SEQ ID NO: 87) 

51 GTTTGATTTCAAGCTTGGTGC 3« 
25 The 306 bp fragment containing the light chain V-region was 
purified. Concurrently, plNG4 000 was digested with Hindlll 
and Xhol, and the 1179 bp DNA fragment containing the kappa 
constant region and all of the Fd' gene was purified. The 
two fragments were ligated to pING374B which had been 
30 digested with Seal and Xhol in a three piece ligation 
yielding pING3757, which contains the Gelonin: : SLT: : kappa 
and Fd genes. 
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(iii) Gelonin: :RMA; ;Fd^ fkappal 

A gelonin gene fusion to the 5 '-end of the H65- 
Fd' chain with the 24 aaino acid RMA linker sequence (20 
amino acids from RMA, plus 4 amino acids introduced to 
5 facilitate cloning) was assembled in a three piece ligation 
from plasmids pING3755, pING3217 and a PCR fragment 
encoding the H65 Vg gene segment. Plasmid pING3755 contains 
the gelonin gene linked in-frame to the RMA linker 
sequence, and pING3217 contains the H65 Fd' and kappa genes 

10 ; in a dicistronic transcription unit. 

Plasmid pING3755 was assembled to contain the 
gelonin gene linked to the RMA linker gene segment. The 
RMA linker gene segment was amplified By PCR from pSH4 with 
primers RMA-iTagl and HJNDIII-2. 

15 RMA-JSagI (SEQ ID NO: 88) 

5 • ACTTCGGCCGCACCATCTGGACAGGCTGGAG 3 • 
HINDIU-'Z (SEQ ID NO: 44) 
5 • CGTTAGCAATTTAACTGTGAT 3 • 
The 198 bp PCR product was cut with Eagl and HindllX, and 

20 the resulting 153 bp DNA fragment was purified. This RMA 

gene segment was . cloned adjacent to gelonin using an PstI 
to Eagl fragment from pING374 8 and the PstI to Hindlll 
vector fragment from pING3825. The product of this three 
piece ligation was pING3755. 

25 For assembly of the gene fusion vector containing 

the Gelonin: : RMA: :Fd' , kappa genes, the H65 was amplified 
by PCR from pING3217 with primers H65-G1 (SEQ ID NO: 84) 
and H65-G2 (SEQ ID NO: 85), and the product was treated 
with T4 polymerase followed by digestion with Ndel. The 

30 186 bp fragment containing the 5 '-end of the heavy chain V- 
region was purified. Concurrently, pING3217 was digested 
with Ndel and Xhol, and the 1307 bp DNA fragment containing 
a portion of the Fd' gene and all of the kappa gene was 
purified. These two fragments were ligated to pING3755 

35 which had been digested with 5cal and Xhol in a three piece 
ligation yielding pING3759 (ATCC 69104), which contains the 
Gelonin: : RMA: :Fd' and kappa genes. 
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(iv) Gelonin; ;RMA: :)cappa f FdO 

A gelonin gene fusion to the 5 '-end of the H65 
Xappa "T:halrr'Vl'th~the 24 amino aTcid—KMA— linker -sequence was 
assembled in a three piece ligation from plasmids pING3755, 
5 pING4 000, and a PCR fragment encoding the H65 gene 

segment. 

For assembly of the gene fusion vector containing 
the Gelonin: :RMA: : kappa and Fd' genes, an H65 Vj^ segment was 
amplified by PCR from pING3217 with primers H65K-1 (SEQ ID 

10 NO: 86) and JKl-lfiudlll, and the product was treated with 
T4 polymerase followed by digestion with Hindlll, The 306 
bp fragment containing the 5'-end of the light chain V- 
region was purified. Concurrently, pING4000 was digested 
with HindlU and Xhol, and the 1179 bp DNA fragment 

15 containing the kappa constant region and all of the Fd' 
gene was purified. These two fragments were ligated to 
pING37 55 which had been digested with Seal and XhoJ in a 
three piece ligation yielding pING375B (ATCC 69103} , which 
contains the Gelonin: :KHA: : kappa and Fd' genes. 

20 C. Direct Fusions Of Gelonin At The 

Amino Terminus Of Antibody Genes 

(i) Gelonin: ;Fd^ f Kappa) 

A direct gelonin gene fusion was constructed from 
pING3754. pING3754 was digested with BgJiI and Xhol and 

25 the vector segment was purified. Concurrently, plNG3754 
was digested with Eagl, treated with T4 polymerase, cut 
with BgJII, and the gelonin gene segment was purified. 
pING3754 was also cut with Fspl and Xhol, and the Fd and 
kappa gene segment was purified. These fragments were 

30 assembled in a three-piece ligation to generate pING3334, 
which contains a direct gene fusion of gelonin to Fd' in 
association with a kappa gene. 
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EXMlPle 15 

Preparation of he3 Fab And Gelonin he3Fab Tmnunofusions 



The sections below detail the construction of 
human-engineering he3Fab protein and immuriofusions of 
5 gelonin to he3 Fd and kappa chains. 
A. he3-Fab Expression Plasmids 

The he3 heavy chain V-region was PCR-amplif ied 
from plasmid pING4621 (pING4621 is fully described above in 
Example 5 above), with primers H65-G3, 

10 r GAGATCCAGTTGGTGCAGTCTG (SEQ ID NO: 116) and H65G2 (SEQ ID 
NO: 85) • Amplification was carried at using vent 
polymerase (New England Biolabs) for 25 cycles, including 
a 94*C denaturation for 1 minute, annd^ling at 50*C for 2 
minutes, and polymerization for 3 minutes at 72'c. The PCR 

15 product was treated with polynucleotide kinase and digested 

with BstEII and the V-region DNA was purified. The 
purified DNA fragment was then ligated into pIClOO, which 
had been digested with 5stl, treated with T4 polymerase, 
and cut with BstEII. The resulting fragment was then 

20 ligated with the BstEII fragment from pING3218 (containing 

Fab' genes) to make pING4623 which contained the he3 Fd 
gene linked to the peiB leader sequence. 

The he3 kappa V-region was next assembled as 
described above in Example 5 and in co-owned, co-pending 

25 U.S. Patent Application Serial No. 07/808,464, incorporated 

by reference herein, using six oligonucleotide primers, 

$H65k-l, AGT CGT CGA CAC GAT GGA CAT GAG GAC CCC 
TGC TCA GTT TCT TGG CAT CCT CCT ACT CTG GTT TCC AGG TAT CAA 
ATG TGA CAT CCA GAT GAC TCA GT (SEQ ID NO: 117); 

30 HUH-K6, TCA CTT GCC GGG CGA ATC AGG ACA TTA ATA 

GCT ATT TAA GCT GGT TCC AGC AGA AAC CAG GGA AAG CTC CTA AGA 
CCC T (SEQ ID NO: 118); 

HUH-K7, TGA CTC GCC CGG CAA GTG ATA GTG ACT CTG 
TCT CCT ACA GAT GCA GAC AGG GAA GAT GGA GAC TGA GTC ATC TGG 

35 ATG TC (SEQ ID NO: 119) ; 
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HUH-K8, GAT CCA CTG CCA CTG AAC CTT GAT GGG ACC 
CCA GAT TCC AAT CTG TTT CCA CGA TAG ATC AGG GTC TTA GGA GCT 
TTC C (SEQ-ID-NO:- 120) ; - 

HUH-K4, GGT TCA GTG GCA GTG GAT CTG GGA CAG ATT 
5 ATA CTC TCA CCA TCA GCA GCC TGCi AAT ATG AAG ATT TTG GAA TTT 

ATT ATT G (SEQ ID NO: 121); and 

HUH-K5, GTT TGA TTT CAA GCT TGG TGC CTC CAC CGA 
ACG TCC ACG GAG ACT CAT CAT ACT GTT GAC AAT AAT AAA TTC CAA 
AAT CTT C (SEQ ID NO: 122) 
10 and amplified with primers HUK-7 (SEQ ID NO: 92) and JKl- 
^indlll (SEQ ID NO: 87). 

The resulting PCR product wqs treated with T4 
polymerase, digested with Hxndlll, arid purified. The 
purified fragment was then cloned into pIClOO, which had 
15 first been cut with Sstl, treated with T4 polymerase, and 

digested with Xhol, along with the 353 bp HindllZ-Xhol 
fragment encoding the kappa constant region from pING3217. 
The resulting plasmid was pING4627 which contains the he3 
kappa sequence linked in frame to the peiB leader. 
20 Plasmid pING4628, containing the pelB-linked he3 

kappa and- Fd genes under transcriptional control of the 
araB promoter, was assembled from pING4623 and pING4627 as 
follows 

TVn expression vector for unrelated kappa and Fd 
25 genes, pN30C-2, was first cut with Saul and JffcoRI, leaving 

a vector fragment which contains all the features relevant 
to plasmid replication, a tetracycline resistance marker, 
araB transcriptional control, and the 3' end of the Fd 
constant region. [Plasmid pNHX-2 comprises an JEcoRI to 
30 Xhol DNA segment from pING 3104 (described in WO 90/02569, 

incorporated by reference herein) . That segment contains 
the replication, resistance and transcription control 
features of pING3104 and is joined to an Xhol to Saul DNA 
segment from pING1444 (described in WO 89/00999, 
35 incorporated by reference herein) which contains the 3' end 

of an Fd constant region.] Next pING4623 was cut with 
PstI, treated with T4 polymerase, digested with Saul and 
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the peIB::Fd gene segment was then isolated. Plasmid 
PING4627 was cut with Xhol, treated with T4 polymerase, cut 
"with j^coRI TnT~li'g"ate~d to th~e pelB r-Fd gene segment "and the 
pNRX-2 vector fragment to generate the he3-Fab expression 
5 vector pING4 628. That plasmid contains two Xhol sites, one 

located between the kappa and Fd genes, and another 4 bp 
downstream of the termination codon for the Fd gene, 

A vector, PING4 633, which lacks the Xhol site 
between the kappa and Fd genes was constructed. To 

10 assemble pING4 633, pING4623 was cut with EcoRT, treated 

with T4 polymerase, digested with Saul. The peJB::kappa 
gene seginent was then isolated and purified. The pNRX-2 
vector fragment and the peIB::Fd gene segment were then 
ligated to the purified pelB: : kappa gene segment to form 

15 PING4 633. 

Both PING4633 and pING4628 are bacterial 
expression vectors for he3-Fab and each comprises the he3 
Fd and Kappa genes which are expressed as a dicistronic 
message upon induction of the host cell with L-arabinose. 
20 Moreover, pING4628 contains two XhoJ restriction sites, one 
located 4bp past the Fd termination codon and one in the 
intergenic region between the 3' end of the Kappa gene and 
the 5' end of the Fd gene. Plasmid plNG4633 lacks the Xhol 
site in the intergenic region. 

25 B- Purification Of he3Fab 

Plasmids pING4 628 and pING4633 were transformed 
into E. coli E104. Bacterial cultures were induced with 
arabinose and cell-free supernatant comprising the he3Fab 
was concentrated and filtered into 20 mm HEPES, pH 6.8. 

30 The sample was then loaded onto a CM Spheradex column (2.5 
X 3 cm), equilibrated in 20 mM HEPEs, 1.5 mM NaCl, pH 6.8. 
The column was washed with the same buffer and eluted with 
20 mm HEPES, 27 mM NaCl, pH 6.8. The eluate was split into 
2 aligiiots and each was loaded onto and eluted from a 

35 protein G (Bioprocessing) column (2.5 x 2.5 cm) separately. 

The protein G column was equilibrated in 20 mM HEPES, 75 MM 
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NaCl, pH 6.8 and the sample was eluted with 100 mM glycine, 
100 mM NaCl, pH 3.O. The two eluates were cojobined and 
diluted two times with 20 mM HEPES, 3 M ammonium sulfate, 
pH 6.8. The diluted eluates were loaded onto phenyl 
5 sepharose high substitution Fast Flow (Pharmacia) column 
(2.5 X. 3.3 cm), equilibrated n 20 mM HEPES, 1.5 M ammonium 
sulfate, pH 6.8. The column was then eluted with 20 mM 
HEPES, 0.6 M ammonium sulfate, pH 6.8. 

*C. Gelonin; :RMA: :he3Kappa. he3Fd Fusions 
10 Other genetic constructs were assembled which 

included a natural sequence gelonin gene fused to an he3- 
Fab via a linker. 

A fusion comprising Gelonin: :RMA: :he3Kappa, Fd 
was assembled from DNA from plasmids pING3755, pING4 633, 

15 and pING4 628. Both pING4633 and pING4628 were assembled in 

a series of steps whereby the he3 heavy and light V-regions 
were individually linked in-frame to the pelB leader. The 
heavy and light V-regions were then placed together in a 
dicistronic expression vector under the control of the araB 

20 promoter in a bacterial expression vector. 

Assembly of the Gelonin: :RMA: :he3Kappa, he3Fd 
fusions was accomplished by; constructing three DNA 
fragments from plasmids pING3755, pING4 633, and pING4 628. 
The first such fragment was made by digesting pING3755 with 

25 5cal and Xhol which excises the 4bp between those sites. 

The resulting vector fragment was purified* The second 
fragment for use in constructing the above fusions was 
obtained from plasmid pING4633, which was cut with Asel 
(which cuts in Vl) and XhoJ and the resulting 1404 bp 

3 0 fragment, containing the 3' end segment of the Kappa and Fd 

genes, was purified. The third fragment, comprising the 5' 
end of the Kappa variable region coding sequence, was 
prepared from the PGR amplified Vl gene contained in 
pING4 628 using the oligonucleotide primers, Huk-7 and jkl- 

35 ffindlll. The resulting 322 bp PCR-amplif ied V,, fragment was 
treated with T4 polymerase, digested with Asel, and the 86 
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bp fragments containing the 5' end of Vj. was purified. The 
three fragment produced above were ligated together to form 
pING3764 . The DNA sequence of the PGR amplified V~region 
was verified by direct DNA sequencing of pING37€4. 

5 D. Gelonin: : SLT: :he3Kappa > he3Fd Fusion 

A Gelonin: : SLT: :he3Kappa, he3Fd fusion was 
constructed by ligating the pING4633 and pING4628 fragments 
described in section A immediately above with a fragment 
produced from pING3748 which contains Gelonin: : SLT- The 
10 plNG374 8 fragment was produced using Seal and Xhol as 

described immediately above for pING3755. The resulting 
vector was designated pING3763. * 

E. Construction of Expression Vector Containing 
Gelonin: ; SLT: ;he3Fd. he3kappa Fusions 

15 An expression vector containing the 

Gelonin: : SLT: :he3Fd, he3kappa fusion was constructed in two 
steps form DNA segments from plasmids pING3825, pING4628, 
PING4639, PING3217 [described in Better, et al., Proc. 
Natl. Acad. Sci. (USA) ^ 90:457-461 (1993), incorporated by 

20 reference herein], and pING4627. pING3825 was digested 

with Ncol and Xhol, generating a 654 bp fragment containing 
the 3' end of the geldnin gene and a fragment containing 
the 5' end of the gelonin gene which were purified. Next, 
PING4 63 9 was digested with Ncol and Ndel and the 9 03 bp 

25 fragment containing the 3' end of the Gelonin gene, the SLT 

linker, and the 5' end of Vh which resulted was purified. 
Finally, pING4 628 was cut with Ndel and Xhol, resulting in 
a 523 bp fragment containing the 3' end of the Fd gene 
which was purified. The three fragments were then ligated 

30 to form plasmid pING3765 which contains a gene encoding a 
gelonin: : SLT: :he3Fd fusion. 

Three vector fragments were used to assemble the 
final expression vector (containing the gelonin: : SLT: :he3Fd 
and he3 kappa segments). Plasmid pING3765 was digested 

35 with Xhol, treated with T4 polymerase, cut with Nhel (which 

releases a 265 bp fragment encoding the tetracycline 
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resistent marker) , and the resulting vector fragment was 
purified, Plasmid pING4627, which contains the heBKappa 
gene -linked- dn^frame to the pelB leader was used for the 
construction of pING4628. Plasmid pING4 627 was cut with 
5 Pstl, treated with T4 polymerase, and further digested with 

Sstl. The resulting 726 bp fragment, containing the Kappa 
gene (except 40 bp at the 3' end) was purified. Plasmid 
pING3217 was then cut with 5stl and Nhel, resulting in a 
318 bp fragment containing the 3' end of the Kappa gene and 
10 'downstream portion, including a portion of the tetracycline 
resistance gene, which was purified. Ligation of the 
foregoing three fragments produced the final expression 
vector, pING37 67, 

F» Construction Of Expression Vector 
15 Containing Gelonin; :RMA: :he3Fd Fusions 

Gelonin: :RMA:he3Fd, he3Kappa fusion expression 
vectors was constructed in two steps from plasmids 
PING3825, pING4628, pING3217, and pING4627. The cloning 
scheme used was identical to that used to generate pING3767 
20 except that pING4638 was substituted for plNG4639. Plasmid 

PING4638 differs from plNG4639 as described below in 
Example 16. The intermediate vector encoding the 
Gelonin: :RMA: :Fd fusion was designated pING3766 and the 
final expression vector was designated pING3768. 

25 Example 16 

Gelonin-Single Chain Antibody Fusions 

The natural sequence gelonin gene was also fused 
to a single chain form of the human engineered he3 H65 
variable region. The gelonin gene was positioned at either 
30 the N-terminus or the C-terminus of the fusion gene and the 
SLT or RMA linker peptide was positioned between the 
gelonin and antibody domains to allow intracellular 
processing of the fusion protein with subsequent cytosolic 
release of gelonin. 
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A. Construction of Gel : :RMA: : SCA(Vl-Vh) ; Gel : : SLT: : SCA 
fV,-V«K Gel; :RMA: ;SCA(Vn-V.) , and Gel ; ;SLT; tSCAfVt.-V,^^ 

A single chain antibody (SCA) form of the he3 H65 

variable domain was assembled from previously constructed 

5 genes. This SCA segment consisted of the entire V and J 

region of the one chain (heavy or light) linked to the 

entire V and J segment of the other chain (heavy or light) 

via a 15 amino acid flexible peptide: [(Gly)^ SerJa. This 

peptide is identical to that described in Huston et al., 

10 Proc. Natl. Acad. Sex. USA, 85:5879-5883 (1988); 

Glockshuber et al.. Biochemistry, 25:1362-1367 (1990); and 
Cheadle et al.. Molecular Jjnjuunol., 29:21-10 (1992). The 
SCA was assembled in two orientations: V- 
Jkapp.2 = [ (Gly),Ser]3: :V-Jg^, and V-Jg^.: : [ (Gly) .SerJa: rV-J^.p^.. 

15 Each SCA segment was assembled and subsequently fused to 

gelonin. 

For assembly of the SCA segment V- 
Jk.pp.: • [ (Gly)4Ser]3: :V-JGa„a«, primers HUK-7 and SCFV-1 were 
used to amplify a 352 bp DNA fragment containing the he3 
20 V/J kappa sequences from pING4627 by PCR in a reaction 

containing 10 mM KCl, 20 mM TRIS pH 8 . 8 , 10 mM (NH^)2S02, 2mM 
MgSO*, 0.1% Triton X-100. , 100 ng/ml BSA, 200 iiM of each 
dNTP, and 2 Units of Vent polymerase (New England Biolabs, 
Beverley, Massachusetts) in a total volume of 100 /il. 
25 SCFV-1 (SEO ID NO: 91) 

5' CGGACCCACCTCCACCAGATCCACCGC . 

CACCTTTCATCTCAAGCTTGGTGC 3« 

HUK-7 (SEQ ID NO: 92) 

5" GACATCCAGATGACTCAGT 3* 
30 Concurrently, primers SCFV-2 and SCFV-3 were used to 

amplify a he3 heavy chain V/J gamma segment from pING4 623, 
generating a 4 00 bp fragment. 
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SCFV-2 (SEQ ID NO: 93) 

5 • GGTGGAGGTGGGTCCGGAGGTGGAGGATCTGA 
_GATC.CAG.TIGG.TGCAG3l_ 3J : 



SCFV-3 (SEQ ID NO: 94) 

5 5 • TGTACTCGAGCCCATCATGAGGAGACGGTGACCGT 3 • 

The products from these reactions were mixed and amplified 
with the outside primers HUK-7 and SCFV-3. The product of 
this reaction was treated with T4 polymerase and then cut 
with Xhol. The resulting 728 bp fragment was then purified 

10 "by electrophoresis on an agarose gel* This fragment was 
ligated into the vectors pING3755 and pING3748 (see Example 
10), each digested with Seal and Xhol, The resulting 
vectors pING4637 and pING4412 contain the Gelonin: :RMA: : SCA 
V-Jk.pp.: : [ (Gly)^ser]3: :V-Jg^, and Gelonin: :SLT: :SCA 

15 V-Ji.pp.: : [ {Gly)4Ser]3: :V-Jg^. fusion genes, respectively. 

The 728 bp fragment was also ligated into pIClOO previously 
digested with Sstl , treated with T4 polymerase and digested 
with XhoJ, to generate pING4 635. This plasmid contains the 
peiB leader sequence linked in-frame to the 

20 V-JKapp.:: [(Gly)4Ser]3: :V-JJ^^. gene. The pelB::SCA gene in 

PING4 635 was excised as an EcoRj-XhoJ restriction fragment 
and cloned into the bacterial expression vector to generate 
PING4 64 0. 

Similarly, the SCA V-J^,^.: : [ (Gly) ,Ser]3: : V-J^.pp. 
25 was assembled by amplification of plNG4 627 with primers 
SCFV-5 and SCFV-6 generating a 367 bp fragment containing 
he3 V/J Xappa sequences, 

SCFV-5 (SEQ ID NO: 95) 

5 • GGTGGAGGTGGGTCCGGAGGTGGAGGATCT 
30 GACATCCAGATGACTCAGT 3» 

SCFV-6 (SEQ ID NO: 96) 

5 » TGTACTCGAGCCCATCATTTCATCTCAAGCTTGGTGC 3 • 

and plNG4 623 with primers H65-G3 and SCFV-4 generating a 
385 bp fragment containing he3 gamma V/J sequences by PGR 
35 with Vent polymerase. 

H65--G3 (SEQ ID NO: 97) 

5' GAGATCCAGTTGGTGCAGTCTG 3* 
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SCFV-4 (SEQ ID NO: 98) 
5' CGGACCCACCTCCACCAGATCC 
~ ACCGCCACCTGAGGAGACGGTG^ ' 
The products from these reactions were nixed and amplified 
5 with H65-G3 and SCFV-6. The 737 bp product was treated 

with T4 polymerase and cut with Xi^ol. Ligation into 
pING3755 and pING3748 (digested with Seal and Xhol) 
resulted in assembly of the Gelonin: :RMA: :SCA V- 
JoMna- • t (Gly)*Ser]3: :V-Jk«pp^ gene fusion in pING4638 and 
10 '- Gelonin: :SLT: :SCA V~Jg«^,:. : [ (Gly) ^Ser^g: : V-Jk.pp, gene fusion 
in pING4 639, respectively. 

The vectors pING4637^ pING4412, pING4 638 and 
pING4 639 were each transformed into E. coli strain E104 and 
induced with arabinose. Protein products of the predicted 
15 molecular weight were identified by Western blot with 

gelonin-specif ic antibodies. 

B. Construction of 

SCA(V.-Vt,V : I SLT: : Gelonin Vectors 

The expression vector containing SCA(Vi^- 

20 Vh) : :SLT: : Gelonin fusions was assembled using restriction 
fragments from previously-constructed plasmids plKG4 640 
(containing SCA (Vl^Vh) ) pING4 4 07 (containing 
Kappa: :SLT: :Gelonin, Fd) , and pING3197. Plasmid pING4640 
was first cut with iSspHI, filled in with T4 polymerase in 

25 the presence of only dCTP, treated with mung bean nuclease 

(MBN) to remove the overhang and to generate a blunt end, 
and cut with EcoJlI. The resulting 849 bp fragment was 
purified. The SLT-containing fragment from pING4407 was 
excised by cutting with EagZ, blunted with T4 polymerase, 

30 cut with Xhol, and the approximately 850 bp fragment which 
resulted was purified. The two fragments were ligated 
together into pING3197, which had been treated with EcoRl 
and Xhol to generate plNG4 642. The DNA sequence at the 
BspHI-T4-MBN/iragI junction revealed that two of the 

35 expected codons were missing but that the fusion protein 

was in frame. 
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C. Construction of 

SCAfVp-Vy) : :SLT: rGelonin Vectors 

The expression vector containing the SCA(Vh~ 
Vj,) ; :SLT: :Gelonin fusions was assembled using DNA from 
5 plasmids pING4 636, (the E. coli expression vector for 
SCA(Vh-Vl)) and piNG4407. Plasmid pING4636 was cut with 
BstEll and Xhol and the resulting vector fragment was 
purified. Concurrently, pINg4 636 was used as a template 
for PCR with primers SCFV-7, 

10 5'TGATGCGGCCGACATCTCAAGCTTGGTGC (SEQ ID NO: 112) and H65- 

G13, TGATGCGGCCGACATCTCAAGCTTGGTGC3 ' (SEQ ID NO: 113). The 
amplified product was digested with Eag'L and BstEII and the 
resulting approximately 380 bp fragment was purified. 
Plasmid pING44 07 was then cut with J?agl and Xhol, resulting 

15 in an approximately 850 bp fragment, which was purified. 

The three above fragments were ligated together to produce 
PING4643. 

D. Construction of 

SCAfV,-VH^ : :RMA: rGelonin Vectors 

20 Expression vectors containing SCA(Vl- 

Vg) : :RMA: :Gelonin fusions were assembled using DNA from 
pING4640, pING4408 [Example 14A(iii)], and pING3825 
(Example 2C) . Plasmid plNG4640 was cut with Sail and 
BstEII and the resulting approximately 7 00 bp vector 

25 fragment (containing the tetracycline resistance matter) 
was purified. Next, pING3825 was digested with Ncol and 
Sail, resulting in an approximately 1344 bp fragment 
containing the 3' end of the gelonin gene and adjacent 
vector sequences. That fragment was purified. Plasmid 

30 pING4408 was then PCR amplified with oligonucleotide 

primers , RMA-G3 5 ' TCTAGGTCACCGTCTCCTCACCATCTGGACAGGCTGGA3 ' 
(SEQ ID NO: 114), and gelo-10. The resulting PCR product 
was cut with BstEII and Wcol to generate an approximately 
180 bp fragment containing the 3' end of Vg, RMA, and the 5' 

35 end of the Gelonin gene which was purified. The above 
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three fragments vere ligated to generate the final 
expression vector, pING4644. 



£• Construction of 

SCA(Vp-V, ) : ;RMA: iGelonin Vectors 

5 Expression vectors containing SCAiV^- 

Vl) : :3^: :Gelonin were constructed using DNA f rom pING 4636, 
PING441Q, and PING3825. Plasmid pING4636 Was digested with 
Sail and HindlU and the resulting vector fragment was 
. purified. Next, pING3825 was cut with Wcol and Sail and 

10 the 1344 bp fragment which resulted contained the 3' end of 
the gelonin gene and adjacent vector sequences encoding 
tetracycline resistance was purified Finally, pING4 4io 
was PGR amplified with primers RMA-G4 , 
5'TTCGAAGCTTGAGATGAAACCATCTGGACAGGCTGGA3' (SEQ ID NO: 115) 

15 and gelo-io. The PGR product was cut with i/indlll and 

i^col, resulting in a 180 bp fragment containing the 3'end 
of Vl, 'PMA, and the 5' end of Gelonin and was purified. The 
three above fragments were ligated together to generate the 
final expression vector, pING4645. 

Gelonin: :SCA fusions without a cleavable linker 
may be constructed by deletion of the SLT linker in 
pING4412 using the restriction enzymes ^agl and Fspl. 
Digestion at these sites and religation of the plasmid 
results in an in-frame deletion of the SLT sequence. 

25 Example 17 

Multivalent Immunof usions 

Multivalent forms of the immunof usions may be 
constructed . 

A. Construction of (Gel: :RMA: : kappa, Fd')2 
30 and fGel: rRMA; :Fd^ , kaopaU Expression Vectnrc; 

Bacterial Fab expressionn vectors can result in 

the production of F(ab')2 if the two cysteine residues from 

the human IgGl hinge region are included into the car boxy 1- 

terminus of the Fd protein [Better et aj,. Proa. Natl. 

35 Acad. Sci . USA, ^0:457-461 (1993)], To express a gelonin 
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fusion protein that could form .a bi-valent: structure like 
an F(ab')2# the he3 Fd' {2C) hinge region (Better et al., 
supra) was cloned into the expression vector pING3764 
(Example 15C) encoding the fusion protein Gel: :RMA: : kappa, 
5 Fd. 

Plasmid pING3764 was cut with Xhol and jBsu36I and 
the approximately 7500 bp fragment containing the 
immunofusion gene and vector sequences was purified. 
Plasmid pING4629, which encodes he3 F(ab')2, was also cut 

10 with jBsu36I and Xhol, and the approximately 200 bp DNA 

fragment containing the he3 Fd' (2C) gene segment was 
purified. These two DNA fragments were Jigated to generate 
PING3775 encoding (Gel: :RMA: :kappa, Fd')2. An expression 
vector encoding the fusion protein (Gel: :RMA: :Fd' , kappa)2 

15 was also made. 

B. Construction of Vectors Containing Botli Gel::RMA::Fd 
and Gel::RMA::K Fusions 

In order to construct a plasmid comprising 
Gel::RMA::Fd and Gel::RMA::k fusions, plasmid pING3764 

20 [described above in Example 15(b)) was digested with BsgJ 

and Saul and a 5.7 kb vector fragment containing plasmid 
replication functions, Gel::RMA::k, and the 3' end of Fd 
was isolated and purified. Plasmid plNG3768 [described 
above in Example 15(E)] was digested with Saul and 

25 partially digested with PstI and a 1.5 kb fragment 

containing Gel::RMA::Fd was purified. Finally, pING4000 
[described above in Example 14] was digested with BsgX and 
PstI, generating a 350 bp fragment containing the 3' end of 
the kappa gene. That fragment was purified and the 5.7 kb, 

3 0 1.5 kb, and 350 kb fragments described above were ligated 

together to form pING3770, containing the gelonin: :RMA: :k 
and gelonin : HMA : :Fd fusions. 
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C. Construction of Vectors Containing Both Gel::SLT::Fd 
and Gel::SLT::k Fusions , 

Plasmid pING3772 contains the above-entitled 

fusions and was constructed as follows. Plasmid pING3763 

5 [described above in Example 15(D) ] was digested with Bsgl 

and Saul and a 5.7 kb fragment containing the replication 

functions, the 5' end of Gel::SLT::k and the 3' end of Fd 

was generated and purified. Next, plasmid pING3767 

[described in Example 15(D) above] was digested with 5aul 

10 ^ and PstI, generating a 1.5 kb fragment containing the 5' 
end of the gel:;SLT::Fd fusion. That fragment was purified 
and pING4 000 [described in Example 14 above] was digested 
with Bsgl and Pstl. The resulting ^50 bp fragment was 
purified and the above-described 5,7 kb, 1*5 kb, and 350 bp 

15 fragments were ligated to form pING3772. 

D. Expression of Multivalent Fusions 

Both pING3770 and pING3772 were transformed into 
E» coll (E104) cells by techniques known to those of 
ordinary skill in the art and induced with arabinose. 

2 0 Concentrated supernatants from the transformed cell 

cultures were analyzed by Western blot analysis with rabbit 
anti-gelonin antiserum. Transf ormants from both plasmids 
generated a reactive band on the gel at the size expected 
for a Fab molecule carrying two gelonins (approximately 105 

25 kD) . These results are consistent with the production of 

fusion proteins comprising monovalent Fab, with both Fd and 
kappa chains separately fused to gelonin. 

coll strains containing plasmids pING3775/ 
pING3770 and plNG3772 were grown in fermenters and the 

30 fusion protein products were purified. The 
(Gel: :RMA: :kappa,Fd' )2 expressed from pING3775 was purified 
as described in Better et al., supra. 
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Egample 18 

Construction of Expression Vectors 
Encoding ImmunefusAons-Without Linkers - 

Expression vectors encoding direct fusions of 
5 gelonin and dicistronic he3 Fab protein or single chain 
antibody were constructed as follows. 

A. V^V^; :Gel 

Plasmid pING4 642 (Example 16B) which encodes the 
Vi,Vb: :SLT: :Gel fusion protein was cut with Fspl and Ncol, 

10 and the approximately 100 bp DNA fragment containing the 

5»-end of the gelonin gene was purified. Plasmid pING4643 
(Example 16C) , which encodes the Vgyi.: : SLT: :Gel fusion 
protein, was cut with Eagl, treated with T4 polymerase and 
cut with Pstl. The approximately 850 bp DNA fragment 

15 encoding the VbVj, gene segment was purified. The DNA 

fragments from pING4642 and pING4 64 3 were ligated into the 
vector DNA fragment from pING4644 (Example 16D) [THIS 
PLASMID CONTRIBUTES THE REST OF THE GELONIN GENE THEN?] 
that had been cut with PstI and Ncol to generate pING3781, 

20 which encodes the VaV^rrGel direct gene fusion. 

B. V,Vn; :Gel 

Plasmid pING4640 which encodes the he3 SCA gene 
Vj^Vfl was cut with BspHI, treated with T4 polymerase in the 
presence of the nucleotide dCTP only, treated with mung 
25 bean nuclease to remove the remaining 5' overhang, and then 
cut with JETcoRI. The approximately 800 bp DNA fragment 
containing the he3 V^Vh gene was then purified on an agarose 
gel. 

Plasmid pING3781 which encodes the direct fusion 
30 VflV^rrGel was digested with Eagl, treated with T4 
polymerase, and then digested with XhoJ. The approximately 
800 bp DNA fragment encoding the gelonin gene was then 
purified on an agarose gel. The two DNA fragments from 
PING464 0 and pING3781 were ligated into the vector DNA from 
35 PING3767 which had been digested ^coRI and Xhol and 
purified on an agaraose gel. The resultant plasmid, 
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pING3348, encoded the VlVh: :Gel fusion protein. The DNA 
sequence at the fusion junction was verified by direct DNA 
sequencing. 

C. Gel; rVff V , 

5 The plasmid pING3755 [Example 14B(iii)], which 

contains the gelonin gene with an engineered Eagl site at 
its 3 '-end, was cut with ^agrl, treated with T4 polymerase, 
and digested with Ncol. The approximately 650 bp DNA 
fragment containing the 3 '-end of the gelonin gene was 
10 purified on an agarose gel. The plasmid pING4 639 (Example 

16A) encoding the fusion Gel: : SLT: : VhVl was cut with Xhol 
and then partially digested with Fspl* The approximately 
730 bp DNA fragment containing all of the he3 VhVl gene was 
then purified in an agarose gel (a single Fspl restriction 
15 site occurs in the Vg gene segment, and the purified he3 VhV,^ 

gene was separated from the incomplete gene segment which 
was approximately 660 bp) . The two DNA fragments from 
PING3755 and pING4639 were ligated into the vector pING3825 
that had been digested with i^col plus Xhol and purified on 
20 an agarose gel. The plasmid pING3350 was generated which 

encoded the GeliiVaVL fusion protein* The DNA sequence at 
the fusion junction was verified by direct DNA sequencing. 

D. Gel: :ViV, 

Plasmid pING3336 which encodes the he3 V{V^ single 
25 chain antibody gene was cut with SstI and Asel, and the 
approximately 5500 bp DNA fragment containing the 3 '-end of 
V^Vg and downstream vector sequences was purified. 
(PING3336 is identical to pING464 0 except that the VlVq gene 
encodes six histidine residues in frame at the carboxyl- 
3 0 terminus) . Plasmid PING4 627 (Example 15A) served as a 
substrate for PGR amplification of the Vh gene segment. 
Plasmid pING4627 was amplified with the two oligonucleotide 
primers HUK-7 (SEQ ID NO: 92) and JKl-Hindlll (SEQ ID NO: 
87), the resultant product was treated with T4 polymerase 
35 and cut with Asel, and the 86 bp DNA fragment containing 
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the 5'~end of the Vl was purified. The DNa fragments from 
pING3336 and pING4627 were ligated to the approximately 
.23-50_bp— DNA— fragment- -of _pXNG3-7^ 5 generated- • by digestion 
with EagI, treatment with T4 polymerase and subsequent 
digestion with Sstl. The resultant vector containing the 
GelziViV^ gene fusion was named pING4652. The DNA sequence 
of pING4 652 was verified at ligation juctions. 

E. Gel::Xappa^ Fd 

The direct gene fusion which encodes Gel::kappa, 
Fd was also assembled from DNA segments froin three 
plasmids. Plasmid pING3764 (Example ISC^ was digested with 
Hindlll and XhoJ, and the approximately 1200 bp DNA 
fragment encoding the 3»-end of the kappa gene and the Fd 
gene was purified. Plasmid pING4652, which encodes a 
direct gene fusion of gelonin to the he3 SCA gene VlVb, was 
cut with Bgrlll and HindJIl , and the approximately 850 bp 
DNA fragment encoding the 3 '-end of the gelonin gene and 
the region of kappa was purified. The DNA fragments from 
PING3764 and pING4652 were ligated into the vector fragment 
from pING3825 (Example 2C) that had been digested with 
Bgill and Xhol to generate pING3784 encoding Gel: : kappa ^ 
Fd. 

F. Gel : : Fd . kappa 

Plasmid pING3768 (Example 15F) , which encodes the 
fusion protein Gel: :RMA: :Fd, kappa, was cut with Ndel and 
Nhel, and the DNA segment containing the majority of the 
he3 Fd gene, the he3 kappa gene and a portion of the 
tetracycline resistance gene of the vector was purified. 
Plasmid pING3350, which is described in section C above, 
was cut with Ndel and PstI, and the DNA fragment containing 
the 5 •-end of the he3 Fd gene linked to the gelonin gene 
was purified. The DNA fragments from pING3 350 and pING3768 
were ligated into the vector fragment from pING4633 
(Example 16D) that had been cut with Nhel and Pstl to 
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generate PING3789. Plasmid pING3789 encodes the fusion 
protein Gel::Fd, kappa ♦ 



Example 19 
Alternative Cathepsin Cleavable Linkers 

The segment of rabbit muscle aldolase chosen for 
the RMA linker described herein is known to contain peptide 
sequences susceptible to digestion with cathepsins. Other 
cathepsin-cleavable protein segments are effective targets 
^ for intracellular cleavage, and two particular amino acid 
sequences were included as cleavable linkers in additional 
immunofusions of the invention. These are the amino acid 
sequence KPAKFFRL (SEQ ID NO: 141 ("CCF«) and KPAKFLRL (SEQ 
ID NO: 142) ("CCL") . Two oligonucleotides were synthesized 
that encode these peptide segments. Degeneracy was 
introduced at one nucleotide position in each synthetic 
primer to allow the appropriate amino acid to be encoded at 
the particular amino acid position in which CCF and. CCL 
differ. The two oligonucleotides 5'- 

GGCCGCAAAGCCGGCTAAGTTCTT(A/C)CGTCTGAGT-3' (SEQ ID NO: 143) 
and 5'-ACTCAGACG(G/T)AAGAACTTAGCCGGCTTTGC-3' (SEQ ID NO: 
144). The oligonucleotide linkers were then used to 
assemble a family of fusion gene expression vectors 
encoding: Gel: :CCL: :kappa, Fd; Gel: :CCF: :kappa, Fd; 
Gel: :CCF: :VLVg; and Gel: :CCL: :VgVL. 

The CCL and CCF linkers were also included 
in fusion vectors where the antigen-binding domain of the 
fusion protein was at the N-terminus of the fusion to 
generate expression vectors encoding immunofusions such as 
VlVh: :CCL: :Gel. 

Several of the fusion proteins with the CCL 
and CCF linkers were tested for cytotoxicity on the T cell 
lines HSB2 and PBMC and were comparable in activity to the 
fusion proteins containing the RMA linker. 
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Example 2 0 

Eypression And Purification Of Gelonin Imnunofusions 
A. Expression Of Gelonin Inununofusions 

Each of the gelonin gene fusions whose 
5 construction is described in Example 15 was co-expressed 
with its pair H65 Fab gene in arabinose-induced E. coli 
strain E104, 

Expression products of the gene fusions were 
detected in the supernatant of induced cultures by ELISA. 

10 Typically, a plate was coated with antibody recognizing 
gelonin. Culture supernatant was applied and bound Fab was 
detected with antibody recognizing human kappa coupled to 
horseradish peroxidase • H65 Fab fragment chemically 
conjugated to gelonin was used a standard. Alternative 

15 ELISA protocols involving coating a plate with antibody 
recognizing either the kappa or Fd or involving a detection 
step with anti-human Fd rather than anti-human kappa 
yielded similar results. Only properly assembled fusion 
protein containing gelonin, kappa and Fd was detected by 

20 this assay. Unassociated chains were not detected. 

The fusion protein produced from induced cultures 
containing expression vectors pING4406, 4407, 4408, and 
4410 in E. coli E104 accumulated at about 20-50 ng/ml. The 
fusion proteins expressed upon induction of pING3754, 3334, 

25 3758 and 3759 (but not pING3757) were expressed at much 
higher levels, at about 100 to 500 ng/ml. A fusion protein 
of about 70,000 Kd was detected in the concentrated E. coli 
culture supernatant by immunostaining of Western blots with 
either anti-human kappa or anti-gelonin antibodies. 

30 The Gelonin: :SLT: :Fd' (kappa) fusion protein from 

PING3754 (ATCC 69102) was purified from induced 10 L 
fermentation broth. The 10 L fermentation broth was 
concentrated and buffer exchanged into lOmM phosphate 
buffer at pH 7.0, using an SIOYIO cartridge (Amicon) and a 

35 DCIO concentrator. The supernatant was purified by passing 
the concentrated supernatant through a DE52 column (20 x 5 
cm) equilibrated with 10 mM sodium phosphate buffer at pH 
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7.0. The flow-through was then further purified and 
concentrated by colximn chromatography on CM52 (5 x 10 cm) 
in 10 mM phosphate buffer. A 0 - 0.2 M linear gradient of 
NaCl was used to the elute the fusion protein, and 
5 fractions containing the fusion protein were pooled and 

loaded onto a Protein G column (Iml) . The fusion protein 
was eluted from protein G with 0.2 M sodium citrate, pH 5.5 
and then 0.2 M sodium acetate, pH 4.5, and finally, 0.2 M 
glycine, pH 2.5. The Gelonin: :RMA: :Fd' (kappa) and 

10 - Gelonin: :RMA: : kappa (Fd') fusions proteins were purified 
from fermentation broths by similar methods except that the 
CM52 column step was eliminated, and the DE52 ccriumn was 
equilibrated with lOOmM sodium phosphatTp buffer at pH 7-0. 
The fusion proteins were not purified to homogeneity. 

15 Each of the three purified fusion proteins was 

then assayed for activity in the RLA assay and for 
cytotoxicity against the T-cell line HSB2. (T cells 
express the CD5 antigen which is recognized by H65 
antibody.) The RIA assay was performed as described in 

20 Example 4 and results of the assay are presented below in 

Table 12. 



Table 12 

Fusion Protein IC5Q(PM1 

rGelonin 11 

25 Gelonin: :SLT: :Fd (kappa) 19 

Gelonin: :RMA: :Fd (kappa) 28 

Gelonin: :KMA: :kappa (Fd) 10 



Two fusion proteins were tested in whole cell cytotoxicity 
assays performed as described in Example 6 (Table 13). As 
30 shown in Table 13, the fusion proteins were active. 

Gelonin: :SLT: :Fd (kappa) killed two T cell lines, HSB2 and 
CEM, with respective IC50S 2-fold (HSB2) or 10-fold (CEM) 
higher than that of the gelonin chemically linked to H65. 
See Table 13 below for results wherein IC50 values were 
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adjusted relative to the amount of fusion protein in each 
sample. 

Table 13 
IC50 (PMT) 

5 Fusion Protein HSB2 Cells CEM Cells 

he3Fab-GelA50(C4o 

Gelonin: :SLT: :Fd (kappa) 180 1007 

Gelonin: :RMA: :Fd (kappa) 150 NT 

These fusion protein showed similar activity on peripheral 
10 blood mononuclear cells (data not shoup) • 

B. Purification of Imrounofusions 

(i) Immunofusions Comprising cH65Fab^ 
Immunof usions comprising a cH65Fab' fragment were 
purified from cell-free supernatants by passing the 

15 supernatant through a CM Spheradex (Sepacor) coluinn (5cm x 

3cm), equilibrated in 10 Mm Na phosphate at pH 7.O. 
Immunofusion proteins bind to the column and are eluted 
with 10 mM Na phosphate, 200 mM NaCl, pH 7.0. The eluate 
was diluted two-fold with 20 Mm HEPES, 3 M ammonium 

20 sulfate, pH 7.6 and loaded onto a phenyl sepharose fast 

flow (Pharmacia) column (2.5 x3.5 cm), equilibrated iii 20 
mM HEPES, 1.2 M ammonium sulfate, pH 7.0. The column was 
next washed with 20 mM Hepes, 1.2 M ammonixm sulfate, pH 
7.0 and eluted with 20 mM HEPES, 0.9 M ammonium sulfate, pH 

25 7.0. The phenyl sepharose eluate was concentrated to a 

volume of 2-4 ml in an Amicon stirred cell fitted with a 
YMIO membrane. The concentrated sample was loaded onto an 
S-200 (Pharmacia) column (3.2x 38 cm), equilibrated in 10 
mm Na phosphate, 150 mm NaCl, pH 7.0. The column was run 

30 in the same buffer and fractions were collected. Fractions 

containing the fusion protein of desired molecular weight 
were combined. For example, by selection of appropriate 
column fractions, both monovalent (gelonin-Fab' ) and 
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bivalent (gelonin2-F(ab' )2 forms encoded by pING3758 were 
purified* 

(ii) Immunofusions Comprising he3Fab 
Immunofusions comprising he3Fab were purified as 

5 in the preceding section with the exception that the phenyl 

sepharose column was e luted with 2 0 mM HEPES, 1.0 M 
ammonium sulfate, pH 7*0. 

(iii) iTnmunofusions Comprising SCA 
Cell-free supernatant was passed through a CM 

10 spheradex column (5 x 3 cm) ^ equilibrated with 10 mM Na 

phosphate, pH 7.0* Single-chain aritibody binds to the 
column which is then washed with 10 mM Naphosphate, 4 5 mM 
NaCl, pH 7,0, The fusion protein was then eluted with 10 
mM Naphosphate, 200 mM NaCl, pH 7.0. The eluate was 

15 diluted two-fold with 20 mM HePES, 3 M ammonium sulfate, pH 

7.0 and loaded onto a butyl sepharose Fast Flow (Pharmacia) 
column (2.5 x 4.1 cm) equilibrated in 20 mM HEPES, 1.5 M 
ammonixrm sulfate, pH 7.0. The column was then washed with 
20 mM HEPES, 1.0 M ammonium sulfate, pH 7.0 and eluted with 

20 20 mM HEPES pH 7.0. The butyl sepharose eluate was 

concentrated to a volume of 2-4 ml in an Amicon stirred 
cell fitted with a YMIO membrane. The concentrated sample 
was loaded onto an S-200 (Pharmacia) column (3.2 x 38 cm) 
equilibrated in 10 mM Na phosphate, 150 mM NaCl, pH 7.0. 

25 The column was then run in the same buffer and the 
fractions were collected. Some of the fractions were 
analyzed by SDS-PAGE to determine which fractions to pool 
together for the final product. 

Example 21 

30 Activity of Gelonin Immunofusions 

A concern in constructing immunofusions 
comprising any RIP is that the targeting and enzymatic 
activities of the components of the fusion protein may be 
lost as a result of the fusion. For example, attachment of 
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an RIP to the amino terminus of an antibody may affect the 
antigen-binding (complementarity-determining regions) of 

the— ant-ibody--and~may— also—result in steric hinderance at 

the active site. Similarly, the activity of an RIP may be 
5 hindered by attachment of an antibody or antibody portion. 
For example, RIPs chemically conjugated to antibodies via 
a disulfide bridge are typically inactive in the absence of 
reducing agents. In order to assess the foregoing in 
immunofusions of the present invention, such proteins were 
10 subjected to assays to determine their enzymatic^ binding, 

and cytotoxic activities. 

■ ■ 

A. Reticulocyte Lvsate Assay 

The enzymatic activity of immunofusions 
comprising gelonin was assayed using the reticulocyte 

15 lysate assay (RLA) describe above. As noted in Example 4, 

the RLA assay measures the inhibition of protein synthesis 
in a cell-free system using endogenous globin mRNA from a 
rabbit red blood cell lysate. Decreased incorporation of 
tritiated leucine (^H-Leu) was measured as a function of 

20 toxin concentration. Serial log dilutions of standard 

toxin (the 30 k6 foinoa of ricin A-chain, abbreviated as RTA 
30) , native gelonin, recombinant gelonin (rGelohin or rGel) 
and gelonin analogs were tested over a range of 1 tig /ml 'to 
1 pg/ml. Samples were tested in triplicate, prepared on 

25 ice, incubated for 30 minutes at 37 'C, and then counted on 

an Inotec Trace 96 cascade ionization counter. By 
comparison with an uninhibited sample, the picomolar 
concentration of toxin (pM) which corresponds to 50% 
inhibition of protein synthesis (IC50) was calculated. 

30 Representative data for various immunotoxins of 

the invention are shown below in Table 14 . 
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Table 14 







Iiranunotoxin 


Lot No. 


IC.nfPM) 




rGel: 


:RMA: :SCA(Vh-Vl} 


AB1136 


12 




rGel: 


:RMA: :SCA(Vl-Vh) 


AB1137 


1 8 

JL O 


5 


rGel: 


:SLT: :SCA(Vb-Vl) 


AB1133 


26 




rGel: 


:3^MA: :SCA(Vl-Vh) 


AB1124 


33 




rGel: 


: RMA : : K+Fd ' ( cH65Fab' ) 


AB1122 


54 




rGel: 


:SLT: :K+Fd (he3Fab) 


AB1160 


40 




rGel: 


:RMA: :K+Fd(he3Fab) 


AB1149 


33 


10 


rGel : 


:3WA: :Fd-fK(he3Fab) 


AB1163 


14 




rGel: 


:Fd'+K(cH65Fab') 


AB1123 


45 



Contrary to the expectations discussed above, 
gelonin immunofusions of the invention exhibit enzymatic 
activity which is comparable to the activities of native 
15 and recombinant gelonin shown in Example 4. This was true 

for fusions made with either the reducible (SLT) or non- 
reducible (RMA) linkers. 

B. Binding Activity of Immunofusions 

Several immunofusions according to the present 

20 invention were assayed for their ability to compete with 

labelled antibody for binding to CD5-positive cells. The 
Kd of the immunofusions was estimated by three different 
means as shown in Table 15. The first Kd estimation (Kdj in 
Table 15) was obtained by competition with fluorescein- 

25 labelled H65 IgG for binding to M0LT-4X cells (ATCC CRL 
1582) according to the procedure reported in Knebel, et 
al . , CytomBtry Suppl . , 2 : 68 (1987) , incorporated by 
reference herein. 

The second Kd measurement (Kdz in Tablel5) was 

30 obtained by Scatchard analysis of competition of the 
immunofusion with ^"l-cH65 IgG for binding on M0LT-4M cells 
as follows. A 20 /ig aliquot of chimeric H65 IgG (cH65 IgG) 
was iodinated by exposure to 100 fil lactoperoxidase-glucose 
oxidase immobilized beads (Enzymobeads, BioRad) , 100 ill of 

35 PBS, 1.0 mCi I^" (Amersham, IMS30) , 50 Ml of 55 mM 
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b-D-glucose for 45 minutes at 2 3**C. The reaction was 
quenched by the addition of 20 /il of 105 mM sodium 
metabisulf ite and 120 mM potasisium iodine followed by 
centrifugation for 1 minute to pellet the beads. "^I-cH65 
5 TgG was purified by gel filtration using a 7 ml column of 

sephadex G25, eluted with PBS (137 mM NaCl, 1.47 mM KHiPO^, 
8.1 mM NazHPO^, 2.68 mM KCl at pH 7.2-7.4) plus 0.1% BSA. 
^"l-cH65 IgG recovery and specific activity were determined 
by TCA precipitation. 

Competitive binding was performed as follows: 
100 /il of Molt-4M cells were washed two times in ice-cold 
DHB binding buffer (Dubellco's modified Eagle's mediiun 
(Gibco, 320-1965PJ), 1.0% BSA and 10 *mM Hepes at pH 7.2 
-7.4). Cells were resuspended in the same buffer, plated 
into 96 v-bottomed wells (Costar) at 3 x 10^ cells per well 
and pelleted at 4*C by centrifugation for 5 min at 1,000 
rpm using a Beckman JS 4.2 rotor; 50 /il of 2X-concentrated 
0.1 nM ^"l-cH65 IgG in DHB was then added to each well and 
competed with 50 /xl of 2X - concentrated cH65 IgG in DHB at 
final protein concentrations froin 100 nM to 0.0017 nM. The 
concentrations of assayed proteins were determined by 
measuring absorbance (A2B0 and using an extinction 
coefficient of 1.0 for fusion proteins, 1.3 for Fab, and 
1.22 for Fab conjugated to gelonin. Also, protein 
concentrations were determined by BCA assay (Pierce 
Chemical) with bovine serum albumin as the standard. 
Binding was allowed to proceed at 4*^0 for 5 hrs and was 
terminated by washing c^lls three times with 200 ^1 of DHB 
binding buffer by centrifugation for 5 min. at 1,000 rpm. 
All buffers and operations were at 4*C. Radioactivity was 
determined by solubilizing cells in 100 Ml of 1.0 M NaOH 
and counting in a Cobra II auto gamma counter (Pac)card) . 
Data from binding experiments were analyzed by the weighted 
nonlinear least squares curve fitting program, MacLigand, 
a Macintosh version of the computer program "Ligand" from 
Munson, Analyt. Biochem., 107:220 (1980), incorporated by 
reference herein. 
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Finally, the Kd (Kdg in the Table) was estimated 
by examination of the ED50 values obtained from separate 
competition binding assays, performed as described in the 
previous paragraph. All three measurements are shown in 
5 Table 15 below: 



Table 15 



noiecuxe lype 


Ml 


Kdg 


Kd^ 


H65 IgG 


1.6 


ND 


ND 


CH65 IgG 


ND 


3.0 


2.5 


cH65Pab' 


4.0 


14.0 


ND 


cH65Fab'-rGelA50(C4«) 


3.5 


* 13.0 


ND 


rGel: :RMA: :K+Fd' (cH65Fab' ) 


16.0 


ND 


100 


he3Fab 


1.20 


2.60 


ND 


he3Fab~rGelA50(C44) 


1.10 


2.70 


ND 


rGel: :ra4A: :K-fFd' (he3Fab) 


2.60 


ND 


5.0 


rGel : : SLt : : K+Fd (he3Fab) 


ND 


ND 


30 


SCA(Vl-Vh) 


2.20 


ND 


30 


rGel: :RMA: :SCA(VB-Vt) 


3.50 


ND 


20 


rGel: :RMA: :SCA(Vl-Vh) 


4.70 


ND 


30 


SCA(Vl-Vb) 


ND 


ND 


20 


rGel: :RMA: :SCA(Vl^Vh) 


2.30 


ND 


ND 


ND = not determined 









The results presented in Table 15 suggest that 
Fab and SCA antibody forms may retain substantial binding 
25 activity even when fused to an RIP. 

C. Comparative Cytotoxicity Assays 

Fusion proteins and immunocon jugates according to 
the present invention were used in a comparative cytoxicity 
assay. Two types of assays were conducted, one targeting 
30 T cell line HSB2, and the other targeting lectin-activated 
peripheral blood mononuclear cells (PBMC) according to 
procedures in Example 6. The results of the assays are 
presented below in Tables 16a, I6b and 16c. 
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The results presented in Tables 16a, 16b and 16c 
demonstrate that gelonin immunofusions may vary in their 
activity. In general, inmunof usions of the invention which 
have IC50 median or mean values of less than 2000 pM Toxin 
5 display strong activity; whereas those with IC50 values 

equal to or less than 500 pM Toxin are considered highly 
active* In sum, the results in Tables 16a, 16b and 16c 
demonstrate that the optimum fusion protein for killing a 
particular cell line may vary depending upon the targeted 
10 cell. 

Example •^2 

Preparation Of BRIP 

BRIP possesses characteristics which make it an 
attractive candidate for a component of immunotoxins. BRIP 

15 is a naturally unglycosylated protein that may have reduced 

uptake in the liver and enhanced circulatory residence time 
in vivo. Additionally, BRIP is less toxic and less 
immunogenic in animals than the A-chain of ricin. Cloning 
of the BRIP gene and expression of recombinant BRIP in an 

20 E. calx expression system obviates the need to purify 
native BRIP directly from barley, and enables the 
development of analogs of BRIP which may be conjugated with 
an available cysteine residue for conjugation to 
antibodies. 

25 A. Purification Of BRIP And Generation 
of Polyclonal Antibodies To BRIP 

Native BRIP was purified from pearled barley 

flour. Four kilograms of flour was extracted with 16 

liters of extraction buffer (10 mM NaP04, 25 mM NaCl, pH 

30 7.2) for 20 hours at 4'C. The sediment was removed by 

centrif ugation, and 200 ml of packed S-Sepharose 
(Pharmacia, Piscataway, New Jersey) was added to absorb 
BRIP. After mixing for 20 hours at 4'C, the resin was 
allowed to settle out, rinsed several times with extraction 

35 buffer and then packed into a 2.6 x 4 0 cm column. Once 
packed, the column was washed with extraction . buff er (150 
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Bl/h) until the absorbance of the effluent approached zero. 
BRIP was then eluted with a lineiar gradient of 0.025 to 0.3 . 
M NaCl in extraction buffer and 5 ml fractions were 
collected. BRIP-containing peaks (identified by Western 
5 analysis of column fractions) were pooled, concentrated to 
about 20 ml, and then chroma tographed on a 2.6 x 100 cm 
Sephacryl S-200HR (Pharmacia) column equilibrated in 10 mM 
NaPO^, 125 mM NaCl, pH 7.4 (10 ml/hr) . BRIP-containing 
peaks were pooled again, concentrated, and stored at -70 *C. 

10 The resulting purified . BRIP protein had a 

molecular weight of about 30,000 Daltons, based upon the 
mobility of Coomassie-stained protein i?ands following SDS- 
PAGE. The amino acid composition was consistent with that 
published by Asano et al., Carlsberg Res. Comm., 49:619-626 

15 (1984). 

Rabbits were immunized with purified BRIP to 
generate polyclonal antisera. 

B. Cloning Of The BRIP Gene 

A cDNA expression library prepared from 

20 germinating barley seeds in the phage X expression vector 

XZAPII was purchased from Stratagene, La Jolla, CA. 
Approximately 700,000 phage plagues were screened with 
anti-BRIP polyclonal antisera and 6 immunoreactive plagues 
were identified. One plague was chosen, and the cDNA 

25 contained therein was excised from XZAPII with EcoRl and 

subcloned into pUC18 generating the vector pBSl. The cDNA 
insert was sequenced with Sequenase (United States 
Biochemical, Cleveland, Ohio) . The DNA sequence of the 
native BRIP gene is set out in SEQ ID NO: 12. To confirm 

30 that cDNA encoded the native BRIP gene, the cDNA was 

expressed in the coll plasmid pKK233-2 (Pharmacia) • 
BRIP protein was detected in IPTG-induced cells transformed 
with the plasmid by Western analysis with above-described 
rabbit anti-BRIP antisera. 
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C. Construction Of An coli Expression 
Vector containing The BRIP Gene 

Barley cDNA containing the-BRIP gene was linked 

to a pelB leader sequence and placed under control of an 

5 araB promoter in a bacterial secretion vector • 

An intermediate vector containing the BRIP gene 

linked to the pelB leader sequence was generated. Plasmid 

pBSl was cut with Ncol, treated with Mung Bean Nuclease, 

cut with Bamni and the 760 bp fragment corresponding to 

10 . amino acids 1-256 of BRIP was purified from an agarose gel. 

Concurrently, a unique Xhol site was introduced downstream 
of the 3 '-end of the BRIP gene in pBSl by PCR amplification 
with a pUC18 vector primer (identical to the Reverse® 
primer sold by NEB or BRL but synthesized on a Cyclone 

15 Model 8400 DNA synthesizer) and the specific primer BRIP 

3*Xho. The sequence of each of the primers is set out 
below* 

Reverse (SEQ ID NO: 45) 
5» AACAGCTATGACCATG 3' 

20 BRIP 3'Xho (SEQ ID NO: 46) 

5' TGAACTCGAGGAAAACTACCTATTTCCCAC 3» 
Primer BRIP 3 *Xho includes a portion corresponding to the 
last 8 bp of the BRIP gene, the termination codon and 
several base pairs downstream of the BRIP gene, and an 

25 additional portion that introduces a Xhol site in the 

resulting PCR fragment. The PCR reaction product was 
digested with BamHI and Xhol, and an 87 bp fragment 
containing the 3»-end of the BRIP gene was purified on a 5% 
acrylamide gel. The 760 and 87 bp purified BRIP fragments 

30 were ligated in the vector pING1500 adjacent to the pelB 

leader sequence. pINGlSOO had previously been cut with 
SstI, treated with T4 polymerase, cut with Xhol, and 
purified. The DNA sequence at the junction of the pelB 
leader and the 5 •-end of the BRIP gens was verified by DNA 

35 sequence analysis. This vector was denoted pING3321-l. 

The final expression vector was assembled by 
placing the BRIP gene under the control of the inducible 
araB promoter. Plasmid pING3321-l was cut with PstI and 
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Xhol, and the BRIP gene linked to the pelB leader was 
purified from an agarose gel. The expression vector 
._pIN.G3217, containing the araB promoter, was cut with PstI 
and Xhol and ligated to the BRIP gene. The expression 
5 vector was denoted pING3322. 

Arabinose induction of JET. coJi cells containing 
the plasmid pING3322 in a fermenter resulted in the 
production of about 100 mg per liter of recombinant BRip. 
E. coli^produced BRIP displays properties identical to BRIP 
10 " purified directly from barley seeds. 

D. Construction Of BRIP Analogs 
With A Free Cysteine Residue 

The BRIP protein contains no cysteine residues, 

and therefore contains no residues directly available which 

15 may form a disulfide linkage to antibodies or other 
proteins. Analogs of recombinant BRIP were generated which 
contain a free cysteine residue near the C-terminus of the 
protein. Three residues of the BRIP protein were targets 
for amino acid substitutions. Comparison of the amino acid 

20 sequence of BRIP to the known tertiary structure of the 

ricin A-chain (see FIG. 2) suggested that the three 
positions would be available near the surface of the 
molecule. The three BRIP analogs include cysteines 
substituted in place of serinea??* alanine^^o, and leucineasg 

25 of the native protein, and were designated BRIPc277 (SEQ ID 
NO: 127) , BRIPc27o (SEQ ID NO: 128) and BRIP^^^^ (SEQ ID NO: 
129), respectively. 

(1) A plasmid vector capable of expressing the 
BRIPc277 analog was constructed by replacing the 3 '-end of 

30 the BRIP gene with a DNA segment conferring the amino acid 

change. The JE^coRI fragment containing the BRIP gene from 
pBSl was subcloned into M13mpl8, and single-stranded DNA 
(anti-sense strand) was amplified by PGR with primers OBM2 
(corresponding nucleotides -11 to +8 of the BRIP gene) and 

35 0MB4 (corresponding to amino acids 264-280 of BRIP and the 

termination codon of BRIP, and incorporating the 



wo 94/26910 



PCTAJS94/05348 



-133- 

substitution of a cysteine codon for the native codon for 
serine277 of native BRIP) • The sequences of primers 0BM2 and 
OMB4, wherein the underlined nucleotides encode the 
substituted cysteine, are set out below. 
5 CBM2 (SEQ ID NO: 47) 

5 ' GCATTACATCCATGGCGGC 3' 
0MB4 (SEQ ID NO: 48) 

5« GATATCTCG AGTTAACTATTTCCC AC CACA CG 
CATGGAACAGCTCCAGCGCCTTGGCCACCGTC 3« 
10 ' A fragment containing a BRIP gene in which the codon for 
the amino acid at position 277 was changed to a cysteine 
codon was amplified. The fragment was cloned into the Sjnal 
site of pUC19 (BRL) and the plasmid generated was denoted 
pMB22. pMB22 was digested with EcoRl and an EcoRl-XhoZ 
15 linker (Clonetech, Palo Alto^ CA) was ligated into the 

vector. subsequent digestion with Xhol and religation 
generated vector pINGMB2X. A BamHl to Xhol fragment 
encoding the 3'-end of BRIP with the altered amino acid was 
excised from.pMB2X and the fragment was purified on a 5% 
20 acrylamide gel. This fragment along with an EcdRl to BamHI 

fragment containing the peJB leader sequence and sequences 
encoding the first 256 amino acids of BRIP were substituted 
in a three piece ligation into pING33 22 cut with EcoRl and 
Xhol. The resulting vector containing the BRIPc277 analog 
25 was designated pING3803 (ATCC Accession No. 68722) . 

(2) A BRIP analog with a free cysteine at 
position 256 was constructed using PGR to introduce the 
amino acid substitution. A portion of the expression 
plasmid pING3322 was amplified with primers BRlP-256 and 
30 HINDlll-2. The sequence of each primer is set out below. 

BRIP-256 (SEQ ID NO: 49) 
5« TGT CTGTTCGTGGAGGTGCCG 3* 
HJWDIII-2 (SEQ ID NO: 44) 
5t CGTTAGCAATTTJVACTGTGAT 3» 
35 Nucleotides 4-21 of primer BRIP-256 encode amino acids 256- 

262 of BRIP while the underlined nucleotides specify the 
cysteine to be substituted for the leucine at the 
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corresponding position of the native BRIP protein. Primer 
HINDJlI-2 corresponds . to a portion of the plasmid. The PCR 
product, which encodes the carboxyl terminal portion of the 
BRIP analog, was treated with T4 polymerase, cut with Xhol, 
5 and the resulting fragment was purified on a 5% acrylamide 
gel. Concurrently, plasmid pING3322 was cut with BamHI, 
treated with T4 polymerase, cut with J5coRI^ and the 
fragment containing the pelB leader sequence and sequences 
encoding the first 256 amino acids of BRIP was purified. 

10 "The two fragments were then assembled back into pING3322 to 
generate the gene encoding the analog BRIPcass. This plasmid 
is denoted pING3801. 

(3) A BRIP analog with a cysteine at position 
270 was also generated using PCR. A portion of the 

15 expression plasmid pING3322 was amplified with primers 

BRIP-270 and the HINDlll-2 primer (SEQ ID NO: 44). The 
sequence of primer BRIP-270 is set out below. 

BRIP-270 (SEQ ID NO: 50) 

5 • CCAAGTGTCTGGAGCTGTTCCATGCGA 3 ' 

20 Primer BRIP-270 corresponds to amino acids 268-276 of BRIP 
with the exception of residue 27 0. The codon of the primer 
corresponding to position 27 0 specifies a cysteine instead 
of the alanine present in the corresponding position in 
native BRIP. The PCR product was treated with T4 

25 polymerase, cut with Xhol, and the 51 bp fragment, which 

encodes the carboxyl terminal portion of the analog, was 
purified on a 5% acrylamide gel. The fragment 

(corresponding to amino acids 268-276 of BRIPc27o) was cloned 
in a three piece ligation along with the internal 151 bp 

3 0 BRIP restriction fragment from 5st II to WscI (corresponding 

to BRIP amino acids 217-267) from plasmid pING3322, and 
restriction fragment from Sstll to Xhol from pING3 322 
containing the remainder of the BRIP gene. The plasmid 
generated contains the gene encoding the BRIP^jyo analog and 

35 is designated pING3802. 
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E. Purification Of Recombinant 
BRIP And The BRIP Analogs 

ResGiDbinant— BRIP — (i^BR-I-P) —and -the -BRIP - analogs 

with free cysteine residues were purified essentially as 

5 described for native BRIP except they were prepared, from 

concentrated fermentation broths. For rBRIP, concentrated 

broth from a 10 liter fermentation batch was exchanged into 

10 mM Tris, 20 mM NaCl pH 7.5, loaded onto a Sephacryl S- 

200 column, and eluted with a 20 to 500 mM NaCl linear 

10 ^ gradient. Pooled rBRIP was further purified on a Blue 
Toyopearl® column (TosoHaas) loaded in 20 mM NaCl and 
eluted in a 20 to 50Q mM NaCl gradient in lOmM Tris, pH 
7.5. For BRIP analogs, concentrated* fermentation broths 
were loaded onto a CM52 column (Whatman) in 10 mM phosphate 

15 buffer, pH 7.5, and eluted with a 0 to 0.3M NaCl linear 

gradient. Further purification was by chromatography on a 
Blue Toyopearl® column. 

F. Reticulocyte Lvsate Assay 

The ability of the rBRIP and the BRIP analogs to 
2 0 inhibit protein synthesis in vitro was tested by 

reticulocyte lysate assay as described in Example 1. 
Serial log dilutions of standard toxin (RTA 30), native 
BRIP, rBRIP and BRIP analogs were tested over a range of 1 
/ig/ml to 1 pg/ml. By comparison with an uninhibited 
25 sample, the picomolar concentration of toxin (pM) which 

corresponds to 50% inhibition of protein synthesis (IC50) 
was calculated. The results of the assays are presented 
below in Table 17. 
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Table 17 



Toxin ICcn fpM) 
RTA 30 3-1 

Native BRIP 15 

5 rBRIP 18 

BRIPc256 23 

BRIPc270 20 

BRIPc277 24 



The RLA results indicate thai: the BRIP analogs 
10 exhibit ribosome-inactivating activity comparable to that. 

of the recombinant and native BRIP tox^n. All the analogs 
retained the natural ability of natiVe BRIP to inhibit 
protein synthesis, suggesting that amino acid substitution 
at these positions does not affect protein folding and 
15 activity. 

Construction Of BRIP Iironunoconiuaates 

Immunoconjugates of native BRIP (SEQ ID NO: 3) 
with 4A2 (described in Morishima et al., J". Immunol., 

20 129:1091 (1982) and H65 antibody (obtained from hybridoma 
ATCC HB9286) which recognize the T-cell determinants CD7 
. and CD5, respectively, were constructed. Immunoconjugates 
of ricin A-chains (RTAs) with 4A2 and H65 antibody were 
constructed as controls. The H65 antibody and ricin A- 

25 chains as well as the RTA immunoconjugates were prepared 
and purified according to methods described in U.S. Patent 
Application Serial No. 07/306,433 supra and in 
International Publication No. WO 89/06968. 

To prepare immunoconjugates of native BRIP, both 

30 the antibody (4A2 or H65) and native BRIP were chemically 

modified with the hindered linker 5-methyl-2-iminothiolane 
(M2IT) at lysine residues to introduce a reactive 
sulfhydryl group as described in Goff et al., Bioconjugate 
Chem., 1:381-386 (1990). BRIP (3 mg/ml) was first 

35 incubated with 0.5 mM M21T and 1 mM DTNB in 25 mM 
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triethanolamine, 150 mM NaCl, pH 8.0, for 3 hours at 25 *C. 
The derivitized BRIPr- (M2IT) -S-S-TNB vas then desalted on a 
column of Sephadex GF-D5LS and the number of thiol groups 
introduced was guantitated by the addition of 0.1 inM DTT. 
5 On average, each BRIP molecule contained 0.7 SH/mol. 

4A2 or H65 antibody (4 mg/ml) in triethanolamine 
buffer was similarly incubated with M2IT (0.3 mM) and DTNB 
(1 mM) for 3 hours at 25 *C. Tlntibody- (M21T) -S-S-TNB was 
then desalted and the TNB: antibody ratio was determined* 

10 : To prepare the conjugate, the BRIP- (M2IT) -S-S-TNB was first 
reduced to BRIP-(M2IT)-SH by treatment with 0.5 mM DTT for 
1 hour at 25 *C, desalted by gel filtration of Sephadex® cr- 
05LS to remove the reducing agent, •and then mixed with 
antibody-- (M2IT)-S-S-TNB. 

15 Following a 3 hour incubation at 25 'C, and an 

additional 18 hours at 4*C, the conjugate was purified by 
sequential chromatography on AcA44 (IBF) and Blue 
Toyopearl®. Samples of the final product were run on 5% 
non-reducing SDS PAGE, Coomassie stained, and scanned with 

20 a Shimadzu laser densitometer to quant itate the number of 

toxins per antibody. 

The BRIP analogs containing a free cysteine were 
also conjugated to 4A2 and H65 antibodies. The analogs 
were treated with 50 mM DTT either for 2 hours at 25 *C or 

25 for 18 hours at 4*C to ex-pose the reactive sulfhydryl group 
of the cysteine and desalted. The presence of a free 
sulfhydryl was verified by reaction with DTNB [Ellman et 
al.. Arch. Biochem. Biophys, 82:10-77 (1959)]. 4A2 or H65 
antibody derivatized as described above with M2IT was 

30 incubated with the reduced BRIP analogs at a ratio of 1:5 

at room temperature for 3 hours and then overnight at 4*C. 
Immunoconjugates H65-BRIPc256f 4A2-BRIPc256 f H65-BRIPc277 were 
prepared in 25 mM triethanolamine, 150 mM NaCl pH 8, while 
immunoconjugates HS5-BRIPc270f 4A2-BRIPc27o and 4A2-BRIPc277 

35 were prepared in 0.1 M sodixim phosphate, 150 mM NaCl pH 
7-5. Following conjugation, 10 /xM mercaptoethylamine was 
added for 15 minutes at 25 'C to quenched any unreacted m2IT 
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linkers on the antibody. The quenched reaction solution 
was promptly loaded onto a gel filtration column (AcA44) to 

— ipemeve ^uneon^ugated Mrbosome-inactivating — protein. 



Purification was completed using soft gel affinity 
5 chromatography on Blue Toyopearl® resin using a method 
similar to Knowles et a J., AnaJyt. Biochem.^ 150:440 
(1987). Samples of the final product were run on 5% non- 
reduced SDS PAGE, Coomassie stained, and scanned with a 
Shimadzu laser densitometer to guantitate the number of 

10 toxins per antibody • The conjugation efficiency was 
substantially greater for BRIPc277 ( 78%) than for either of 
the other two analogs, BRIPc27o and BRIPca«6 (each of these was 
about 10%) • Additionally, the BRIPC277 product was a 
polycon jugate, i.e., several BRIP molecules conjugated to 

15 a single antibody, in contrast to the BRIPc27o and BRIPc256 

products which were monoconjugates. 



Example 24 

Properties Of BRIP Immunoconiuaates 
A. Whole Cell Kill Assay 

20 Immunoconjugates of native BRIP and of the BRIP 

analogs were tested for the ability to inhibit protein 
synthesis in HSB2 cells by the whole cell kill assay 
described in Example 1. Standard immunoconjugates H65--RTA 
(H65 derivatized with SPDP linked to RTA) and 4MRTA (4A2 

25 antibody derivatized with M2IT linked to RTA) and BRIP 
immunocon jugate samples were diluted with RPMI without 
leucine at half -log concentrations ranging from 2000 to 
0.632 ng/ml. All dilutions were added in triplicate to 
microtiter plates containing 1 x 10^ HSB2 cells. HSB2 

30 plates were incubated for 20 hours at 37 *C and then pulsed 
with ^H-Leu for 4 hours before harvesting. Samples were 
counted on the Inotec Trace 96 cascade ionization counter. 
By comparison with an untreated sample, the picomolar toxin 
concentration (pM T) of immunoconjugate which resulted in 

35 a 50% inhibition of protein synthesis (I C50) was calculated. 
The assay results are presented below in Table IB. 
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Table IB 

Conjugate IC^ n ft>M T) 

4A2-BRIP 122 ills 

4A2"BRIPc27o 4 6.3 

5 4A2"BRIPc277 57 .5 

4A2-BRIPc256 m6 
H65-BRIP >5000 
H65-BRIPc277 1176 

The BRIP analog conjugates were less potent than 
10 the ricin conjugate control (data not shown) • The 
immunotoxins containing antibody 4A2 and either the BRIPc27o 
or the BRIPc277 analog exhibited comparable to increased 
specific cytotoxicity toward target cells as compared to 
inununotoxin containing native BRIP. While 4A2-BRlPc256 is 
15 less active than. 4A2-BRIP, 4A2-BRIPc27o and 4A2-BRIPc277 are 

between 3 and 4 times more active* Similarly, the 
immunocon jugate of H65 to BRIPC277 shows greater toxicity 
toward target cells than the immunocon jugate of H65 to 
native BRIP. Thus, linkage of antibody to BRIP derivatives 
20 which have an available cysteine residue in an appropriate 
location results in immunotoxins with enhanced specific 
toxicity toward target cells relative to conjugates with 
native BRIP. 

B. Disulfide Bond Stability Assav 

25 Immunoconjugates prepared with native BRIP and 

the BRIP analogs were examined by the disulfide bond 
stability assay described in Example 1. Briefly, 
conjugates were incubated with increasing concentrations of 
glutathione for 1 hour at 37 'C arid, after terminating the 

30 reaction with iodoacetamide, the amount of RIP released was 

quantitated by size-exclusion HPLC on a TosoHaas TSK- 
G2000SW column. 

By comparisons with the amount of RIP released by 
high concentrations of 2-mercaptoethanol (to determine 100% 

35 release) , the concentration of glutathione required to 
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release 50% of the RIP (the RC50) was calculated. As shown 
below in Table 19, the conjugates prepared with BRIPc27o or 
BRIPc277 were significantly more stable than either the RTA 



; or those prepared with native BRIP 




Table 19 


Coniuaate 


RCn (TM) 


H65-RTA 




H65-BRIP 


2.8 


H65-BRIPC277 


196.0 


4A2^RTA 


4.4 


4A2-BRIP 


3^3 


42-BRIPc27o 


53.6 


4A2"BRIPc277 


187-0 



These unexpected results suggest that conjugates prepared 
15 with Type I RIP analogs according to the present invention 
may have enhanced stability and efficacy in vivo. 

Example 2 5 

Preparation of Momordin and Analogs Thereof 

Plants of the genus Mdmordica produce a number of 
20 related proteins known as momofdins or momorcharins which 

are Type I RIPs. The gene encoding momordin II was cloned 
from Momordica balsamina seeds. 

A. Preparation Of . Jbalsamina RNA 

Total RNA was prepared from 4 g of balsandna 

25 seeds as described in Ausubel et al., supra. PolyA 

containing RNA was prepared from 1 mg of total RNA by 
chromatography on oligo- (dT) -cellulose. 40 mg of oligo- 
(dT) -cellulose Type 7 (Pharmacia) was added to 0.1 N NaOH 
and poured into a disposable column (Biorad) . The column 

30 was washed with water until the eluate was pH 5.5, and then 
was washed with IX loading buffer (50 mM NaCitrate, 0.5M 
NaCl, 1 mM EDTA, 0.1% SDS, pH 7.0) until the eluate was pH 
7.0. 1 mg of total RNA was suspended in 300 ^1 of water, 
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heated to 65 "C for 5 minutes, and 3 00 /il of 2X loading 
buffer was added (100 mM Na Citrate, IM NaCl, 2 mM EDTA, 
and 0.2% SDsj . The RNA was loaded onto" the column, and the 
flow through was reheated to 65 'C, cooled to room 
5 temperature, and reloaded onto the column. Column-bound 
mJ^A was washed 5 times with 0.5 ml of IX loading buffer, 
and two times with 0.5 ml of 0.05M NaCitrate, 0.1 M NaCl, 
1 mM EDTA, 0.1% SDS. PolyA- containing RNA was eluted two 
times from the column with 0.5 ml of 25 mM NaCitrate, 1 mM 
10 = EDTA, and 0.05% SDS. 

B. Library Preparation 

A cDNA library from the *polyA-containing W. 
balsamina RNA was prepared in a bacterial expression 
plasmid with the Superscript Plasmid System . (BRL, 
15 Gaithersburg, Maryland) . The cDNA was synthesized from 2 

fig of poly A-containing RNA^ size fractionated, digested 
with Nott, and ligated into the expression vector pSPORT as 
recommended by the manufacturer of the vector, BRL. 



C. Cloning Of The Momordin II Gene 

20 A DNA fragment encoding the first 27 amino acids 

of momordin II was amplified from W. balsamina cDNA by PCR. 

First strand cDNA was prepared from 100 ng of polyA 
containing RNA with an RNA-PCR Kit (Perkin Elmer Cetus) . 
Two partially degenerate primers were synthesized based on 

25 the amino acid sequence of the first 27 amino acids of 

momordin II described in Li et al.^ Experientia, 36:524-527 
(1980) . Because the amino acid sequence of amino acids 
27 of momordin II is 52% homologous to amino acids 1-17 of 
momordin I [Ho et al., BBA/ 1088:311-314 (1991)], some 

30 codon assignments in the degenerate primers were based on 

homology to the corresponding amino acid as well as codon 
preference in the momordin I gene. The sequences of 
primers momo-3 and momo-4 are set out below using lUPAC 
nucleotide symbols. 
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ino3DO-3 (SEQ ID NO: 51) 

5' GATCTTAAYTTYGAYTTCTCNACDGCTAC 3' 
iDOino-4 (SEQ ID NO: 52) 

5 » ATTGGNAGDGTAGCCCTRAARTCYTCDAT 3 • 
5 The resulting 81 bp PCR product was purified on a 5% 
acrylamide gel and cloned into the Smal site of pUCls. 
Three candidate clones were sequenced, and one clone, 
pMOllO, was identified which encoded the N-terminal 27 
amino acids of momordin II. 

^0 ' A hybridization probe was designed for screening 

of the momordin II cDNA library based on the sequence of 
the pMOllO moiDordin II DNA fragment. The sequence of the 
primer momo-5 is shown below. 

momo-5 (SEQ ID NO: 53) 

^5 5» GCCA CTGCAT^AAACCTACACAAAATTTA TTOH 31 

Primer momo-5 corresponds to amino acids 9-18 of mature 
momordin II ♦ The underlined nucleotides of the primer were 
expected to match the DNA sequence of the momordin II gene 
exactly. Since this sequence is highly A/T-rich and may 

20 hybridize to the momordin II gene weakly, the additional 

adjacent nucleotides were included in the primer. Bases 3 
and 30 Coverlined). were in the "wobble" position (i.e.; the 
third nucleotide in a codon) of amino acids 9 (alanine) and 
18 (isoleucine) , respectively, of momordin II and may not 

25 be identical to the nucleotide bases in the native gene. 

A 90,000 member cDNA library in pSPORT was 
screened with ^^P-kinased momo-5, and eight potential 
candidate clones were identified. One clone, pING3619, was 
sequenced and contains an open reading frame corresponding 

30 in part to the expected N-terminal 27 residues of Momordin 

II. The complete momordin gene contains 286 amino acids, 
the first 23 of which are a presumed leader signal (mature 
momordin II is 263 residues) . The DNA sequence of the 
momordin II gene is set out in SEQ ID NO: 13. 
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D. Construction Of An Expression 

Vector Containing The Momordin II Gene 



A-baetejrdaO-expiress-i-en-veetorHfor-th n 

gene was constructed. Two PCR primers were synthesized, 
5 one (iDOiDO-9) which primes from the +1 residue of the mature 

momordin II amino acid sequence, and one at the C-terminus 
(momo-lO) of momordin II which- introduces an Xhol 
restriction site: 

momo-9 (SEQ ID NO: 54) 
10 5' GATGTTAACTTCGATTTGTCGA 3» 

momo-10 (SEQ ID NO: 55) 

5' TCAACTCGAGGTACTCAATTCACAACAGATTCC 3» 
pING36l9 was amplified with momo-9 and momo-io, and the 
product was treated with T4 polymerase,' cut with Xhol, and 

15 purified on an agarose gel. This gene fragment was ligated 

along with the 131 bp peJB leader fragment from pIClOO 
which has been generated by Sstl digestion^ T4 -polymerase 
treatment, and EcoRl digestion, into the araB expression 
vector cleaved with JPcoRI and XhoJ. The product of this 

20 three piece ligation was sequenced to verify that the peJB 

junction and momordin II coding sequence were correct, 
Arabinose induction of cells containing the momordin II 
expression plasmid pING3 621 results- in production of 
momordin II in E. coli, 

25 E. Analogs Of Mormordin II 

Mormordin II has no natural cysteines available 
for conjugation to antibody. Analogs of momordin which 
have a free cysteine for conjugation to an antibody may be 
constructed. Positions likely to be appropriate for 

30 substitution of a cysteine residue may be identified from 

Figure 3 as positions near the ricin A-chain cysteinejsj and 
as positions including the last 26 amino acids of momordin 
II that are accessible to solvent. For example, the 
arginine at position 242 of momordin II aligns with the 

35 ricin A-chain cysteine at position 259 and is a preferred 

target for substitution. Additional preferred substitution 
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positions for momordin II include the serine at position 
241 and the alanine at position 243. 

While the present invention has been described in 
terms of preferred embodiments, it is understood that 
5 variations and improvements will occur to those skilled in 

the art. Therefore, it is intended that the appended 
claims cover all such equivalent variations which come 
within the scope of the invention as claimed. 
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SEQUENCE LISTING 
(1) GENERAL INFORMATION: 

(i) APPLICANT: Xoma Corporation 



(ii) TITLE OF INVENTION: ImmunotoxinB Comprising Ribsome-Inactivating 
Proteins 

(iii) NUMBER OF SEQUENCES: 144 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: HarehaXl, O'Toole, Gerstein, Murray & Borun 

(B) STREET; 6300 Sears Tower, 233 South Wacker Drive 
{C> CITY: Chicago 

(D) STATE: Illinois 

(E) COUNTRY: USA 

(F) ZIP: 60606-6402 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: floppy dxsk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/064,691 

(B) FILING DATE: 12~MAY^1993 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/988,430 

(B) FILING DATE: 09-DEC-1992 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT US92/09487 

(B) FILING DATE: 04-NOV-1992 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/901,707 

(B) FILING DATE: 19-JUN-1992 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/787,567 

(B) FILING DATE: 04-NOV-1991 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Noland, Greta £. 

(B) REGISTRATION NUMBER; 35,302 

(C) REFERENCE/DOCKET NUMBER: 32088 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 312/474-6300 

(B) TELEFAX: 312/474-0448 

(C) TELEX: 25-3856 

(2) INFORMATION FOR SEQ ID NO:i: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 267 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

lie Phe Pro Lye Gin Tyr Pro He He Asn Phe Thr Thr Ala Gly Ala 
15 10 15 

Thr Val Gin Ser Tyr Thr Aen Phe He Arg Ala Val Arg Gly Arg Leu 
20 25 30 

Thr Thr Gly Ala Asp Val Arg Hie Glu He Pro Val Leu Pro Asn Arg 
35 40 45 

Val Gly Leu Pro He Aen Gin Arg Phe He Leu Val Glu Leu Ser Aen 
50 55 60 

His Ala Glu Leu Ser Val Thr Leu Ala Leu Aep Val Thr Asn Ala Tyr 
65 70 75 80 

Val val Gly Tyr Arg Ala Gly Aen Ser Ala Tyr Phe Phe His Pro Asp 

85 90 95 

Aan Gin Glu Asp Ala Glu Ala He Thr His Leu Phe Thr Asp Val Gin 
100 105 110 

Aen Arg Tyr Thr Phe Ala Phe Gly Gly Asn Tyr Aep Arg Leu Glu Gin 
115 120 125 

Leu Ala Gly Aen Leu Arg Glu Aen He Glu Leu Gly Aen Gly Pro Leu 
130 135 140 

Glu Glu Ala He Ser Ala Leu Tyr Tyr Tyr Ser Thr Gly Gly Thr Gin 
145 150 155 160 

Leu Pro Thr Leu Ala Arg Ser Phe He He Cye He Gin Met He Ser 

165 170 175 

Glu Ala Ala Arg Phe Gin Tyr He Glu Gly Glu' Met Arg Thr Arg He 
180 185 ■ 190 

Arg Tyr Aen Arg Arg Ser Ala Pro Asp Pro Ser Val He Thr Leu Glu 
195 200 205 

Aen Ser Trp Gly Arg Leu Ser Thr Ala He Gin Glu Ser Asn Gin Gly 
210 215 220 

Ala Phe Ala Ser Pro He Gin Leu Gin Arg Arg Aen Gly Ser Lye Phe 
225 230 235 240 

Ser Val Tyr Asp Val Ser He Leu He Pro He He Ala Leu* Met Val 

245 250 255 

Tyr Arg Cys Ala Pro Pro Pro Ser Ser Gin Phe 
260 265 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acide 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Gly Leu Asp Thr Val Ser Phe Ser Thr Lys Gly Ala Thr Tyr lie Thr 
1 5 4-0 .1-5 — 

Tyr Val Aon Phe Leu Asn Glu Leu Axg Val Lys Leu Lys Pro Glu Glv 
20 25 30 

Asn Ser Hie Gly He Pro Leu Leu Arg Lys Lye Cys Asp Asp Pro Gly 
35 40 45 

Lys Cys Phe Val Leu Val Ala Leu Ser Asn Asp Asn Gly Gin Leu Ala 
50 55 60 

Glu He Ala He Asp Val Thr Ser Val Tyr Val Val Gly Tyr Gin Val 
65 70 75 80 

Arg Asn hrg Ser Tyr Phe Phe Lys Asp Ala Pro Asp Ala Ala Tyr Glu 

85 90 95 

Gly Leu Phe Lys Asn Thr He Lys Thr Arg Leu Hie Phe Cly Gly Ser 
100 105 . ^ 110 

Tyr Pro Ser Leu Glu Gly Glu Lys Ala Tyr Arg Glu Thr Thr Asp Leu 
115 120 125 

Gly He Glu Pro Leu Arg He Gly He Lys Lys Leu Asp Glu Asn Ala 
130 135 140 

He Asp Asn Tyr Lys Pro Thr Glu He Ala Ser Ser Leu Leu Val Val 
145 150 155 160 

He Gin Me^ Val Ser Glu Ala Ala Arg Phe Thr Phe He Glu Asn Gin 

165 170 175 

He Arg Asn Aen Phe Gin Gin Arg He Arg Pro Ala Asn Asn Thr He 
180 185 190 

Ser Leu Glu Asn Lys Trp Gly Lys Leu Ser Phe Gin He Arg Thr Ser 
195 200 205 

Gly Ala Asn Gly Met Phe Ser Glu Ala Val Glu Leu Glu Arg Ala Asn 
210 215 220 

Gly Lys Lys Tyr Tyr Val Thr Ala Val Asp Gin Val Lys Pro Lys He 
225 230 235 240 

Ala Leu Leu Lys Phe Val Asp Lys Asp Pro Lys 

245 250 

(2) INFORMATION FOR SEQ ID N0:3t 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 280 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Ala Ala Lys Met Ala Lys Asn Val Asp Lys Pro Leu Phe Thr Ala Thr 
15 10 15 
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Phe Aen Val Gin Ala Ser Ser . Ala Aep Tyr Ala Thr Phe. lie Ala Gly 
20 25 30 

He Arg Aon Lys Leu Arg. AGn Pro Ala Hie Phe Ser His Aen Arg Pro 
-35 40 45 

Val Leu Pro Pro Val Glu Pro Aen Val Pro Pro Ser Arg Trp Phe Hie 
50 55 60 

Val Val Leu Lye Ala Ser Pro Thr Ser Ala Gly Leu Thr Leu Ala He 
65 70 75 80 

Arg Ala Aep Asn He Tyr Leu Glu Gly Phe Lya Ser Ser Aep Gly Thr 

85 90 95 

Trp Trp Glu Leu Thr Pro Gly Leu He Pro Gly Ala Thr Tyr Val Gly 
100 105 110 

Phe Gly Gly Thr Tyr Arg Asp Leu Leu Gly Asp Thr Asp Lye Leu Thr 
115 120 125 

Asn Val Ala Leu Gly Arg Gin Gin Leu Ala Asp Ala Val Thr Ala Leu 
130 135 140 

His Gly Arg Thr Lys Ala Asp Lys Ala Ser Gly Pro Lye Gin Gin Gin 
145 150 155 160 

Ala Arg Glu Ala Val Thr Thr Leu Val Leu Met Val Asn Glu Ala Thr 

165 170 175 

Arg Phe Gin Thr Val Ser Gly Phe Val Ala Gly Leu Leu His Pro Lys 
180 185 190 

Ala Val Glu Lys Lys Ser Gly Lys He Gly Asn Glu Met Lys Ala Gin 
195 200 205 

Val Asn Gly Trp Gin Asp Leu Ser Ala Ala Leu Leu Lys Thr Asp Val 
210 215 220 

Lys Pro Pro Pro Gly Lys Ser Pro Ala Lys Phe Ala Pro He Glu Lys 
225 230 235 240 

Met Gly Val Arg Thr Ala Glu Gin Ala Ala Asn Thr Leu Gly He Leu 

245 250 255 

Leu Phe Val Glu Val Pro Gly Gly Leu Thr Val Ala Lys Ala Leu Glu 
260 265 270 

Leu Phe His Ala Ser Gly Gly Lys 
275 280 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4s 

Asp Val Asn Phe Asp Leu Ser Thr Ala Thr Ala Lys Thr Tyr Thr Lys 
1 5 10 15 
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Phe He Glu Asp Phe Arg Ala Thr Leu Pro Phe Ser His Lye Val Tyr 
20 25 30 

^ABp- 1 1 e-Pr o-I>eu- -Leu-Tyr-Ser -Thr — rie-Ser— Aep-Ser^Ar g Arg Phe He 

35 40 45 

Leu Leu hep Leu Thr Ser Tyr Ala Tyr Glu Thr He Ser Val Ala He 
50 55 60 

Abp Val Thr Asn Val Tyr Val Val Ala Tyr Arg Thr Arg Asp Val Ser 
65 70 75 80 

Tyr Phe Phe Lys Glu Ser Pro Pro Glu Ala Tyr Asn He Leu Phe Lye 

85 90 95 

Gly Thr Arg Lya He Thr Leu Pro Tyr Thr Gly Asn Tyr Glu Asn Leu 
' 100 105 110 

Gin Thr Ala Ala Hie Lye He Arg Glu Asn He Asp Leu Gly Leu Pro 
115 120 125 

Ala Leu Ser Ser Ala He Thr Thr Leu Phe Tyt Tyr Aen Ala Gin Ser 
130 135 140 

Ala Pro Ser Ala Leu Leu Val Leu He Gin Thr Thr Ala Glu Ala Ala 
145 150 155 160 

Arg Phe Lye Tyr He Glu Arg Hia Val Ala Lys Tyr Val Ala Thr Asn 

165 170 . 175 

Phe Lys Pro Aen Leu Ala He He Ser Leu Glu Asn Gin Trp Ser Ala 
180 185 190 

Leu Ser Lye Gin He Phe Leu Ala Gin Asn Gin Gly Gly Lys Phe Ara 
195 200 205 

Asn Pro Val Asp Leu He Lys Pro Thr Gly Glu Arg Phe Gin Val Thr 
210 215 220 

Asn Val Asp Ser Asp Val Val* Lye Gly Asn He Lys Leu Leu Leu Asn 
225 230 235 240 

Ser Arg Ala Ser Thr Ala Asp Glu Asn Phe He Thr Thr Met Thr Leu 

245 250 255 

Leu Gly Glu Ser Val Val Asn 
260 . 

(2) INFOKMATZON FOR SEQ ID HO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 248 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Asp Val Arg Phe Ser Leu Ser Gly Ser Ser Ser Thr Ser Tyr Ser Lve 
1 5 10 15 

Phe He Gly Asp Leu Arg Lys Ala Leu Pro Ser Asn Gly Thr Val Tyr 
20 25 30 
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Aen Leu Thr He Leu Leu Ser Ser Ala Ser Gly Ala Ser Arg Tyr Thr 
35 40 45 

Leu M et Thr L eu Ser Aen Tyr Asp Gly Lye Ala He Thr Val Ala Val 

50 55~ -60 — 

Asp Val Ser Gin Leu Tyr He Met Gly Tyr Leu Val Aen Ser Thr Ser 
65 70 75 80 

Tyr Phe Phe Aen Glu Ser Aep Ala Lye Leu Ala Ser Gin Tyr Val Phe 

85 90 • 95 

Lye Gly Ser Thr He Val Thr Leu Pro Tyr Ser Gly Aen Tyr Glu Lye 
100 105 110 

Leu Gin Thr Ala Ala Gly Lye He Arg Glu Lys He Pro Leu Gly Phe 
115 120 125 

Pro Ala Leu Aep Ser Ala Leu Thr Thr He Phe His Tyr Asp Ser Thr 
130 135 140 

Ala Ala Ala Ala Ala Phe Leu Val He Leu Gin Thr Thr Ala Glu Ala 
145 150 155 160 

Ser Arg Phe Lye Tyr He Glu Gly Gin He He Glu Arg He Ser Lye 

165 170 175 

ABn Gin Val Pro Ser Leu Ala Thr He Ser Leu Glu Asn Ser Leu Trp 
180 185 190 

Ser Ala Leu Ser Lys Gin He Gin Leu Ala Gin Thr Aen Aen Gly Thr 
195 200 205 

Phe Lye Thr Pro Val Val He Thr Asp Asp Lye Gly Gin Arg Val Glu 
210 215 220 

He Thr Aen Val Thr Ser Lys Val Val Thr Lys Asn He Gin Leu Leu 
225 . 230 235 240 

Leu Asn Tyr Lye Gin Asn Val Ala 

245 

(2) INFORMATION FOR SEQ ID NO? 6? 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 255 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

Aep Val Ser Phe Arg Leu Ser Gly Ala Thr Ser Ser Ser Tyr Gly Val 
15 10 15 

Phe He Ser Aen Leu Arg Lye Ala Leu Pro Asn Glu Arg Lys Leu Tyr 
20 25 30 

Aep Leu Pro Leu He Arg Ser Ser Leu Pro Gly Ser Gin Arg Tyr Ala 
35 40 45 

He He Hie Leu Thr Aen Tyr Ala Asp Glu Val Ala Leu Asp Val Thr 
50 55 60 
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Aen Val Asp Ala Gly Leu Pro Arg Aen Ala val Leu Tyr lie Met Gly 
65 70 75 80 

Tyr Arg. Ala Gly Aep Thr Ser Tyr Phe Phe Aen Glu Ala Ser Ala Thr 

85 90 95 

Glu Ala Ala Lye Tyr Val Phe Lye Asp Ala Met Arg Lye Val Thr Leu 
100 105 110 

Pro Tyr Ser Gly Aen Tyr Glu Arg Leu Gin Thr Ala Ala Gly Gly Leu 
115 120 125 

Arg Glu Aen lie Pro Leu Gly Leu Pro Ala Leu Asp Ser Ala lie Thr 
130 135 140 

Thr Leu Phe Tyr Tyr Asn Ala Aen Ser Ala Ala Ser Ala Leu Met Val 
145 150 155 160 

Leu lie Gin Ser Thr Ser Glu Ala Ala Arg Tyr Lye Phe He Glu Gin 

165 170 175 

Gin He Gly Ser Arg Val Aep Lya Thr Phe Leu^Pro Ser Leu Ala He 
180 185 190 

He Ser Leu Glu Asn Ser Leu Trp Leu Ala Leu Ser Lye Gin He Gin 
195 200 205 

He Ala Ser Thr Asn Aen Gly Glu Phe Glu Thr Pro Val Val Leu He 
210 215 220 

Aen Ala Gin Asn Gin Arg Val Thr He Thr Aen Val Asp Ala Gly Val 
225 230 235 240 

Val Thr Ser Asn He Ala Leu Leu Leu Aen Arg Aen Asn Met Ala 

245 250 255 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 263 amino acids 
(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Asp Val Ser Phe Arg Leu Ser Gly Ala Asp Pro Arg Ser Tyr Gly Met 
1 5 10 15 

Phe He Lys Asp Leu Arg Asn Ala Leu Pro Phe Arg Glu Lys Val Tyr 
20 25 30 

Asn He Pro Leu Leu Leu Pro Ser Val Ser Gly Ala Gly Arg Tyr Leu 
35 40 45 

Leu Met His Leu Phe Asn Tyr Asp Gly Lys Thr He Thr Val Ala Val 
50 55 60 

Asp Val Thr Asn Val Tyr He Met Gly Tyr Leu Ala Asp Thr Thr Ser 
65 70 75 80 

Tyr Phe Phe Asn Glu Pro Ala Ala Glu Leu Ala Ser Gin Tyr Val Phe 

85 90 95 



wo 94/26910 



PCTAJS94/05348 



-152- 

Arg Asp Ala Axg Arg Lys lie Thr Leu- Pro Tyr Ser Gly Aen Tyr Glu 
100 105 110 

Arg Leu Gin lie Ala Ala Gly Lys Pro Arg Glu Lye He Pro He Gly 
115 120 125 

Leu Pro Ala Leu Aap Ser Ala He Ser Thr Leu Leu His Tyr Asp Ser 
130 135 140 

Thr Ala Ala Ala Gly Ala Leu Leu Val Leu He Gin Thr Thr Ala Glu 
145 150 155 160 

Ala Ala Arg Phe Lye Tyr He Glu Gin Gin He Gin Glu Arg Ala Tyr 

165 170 175 

Arg Asp Glu Val Pro Ser Leu Ala Thr He Ser Leu Glu Aen Ser Trp 
180 185 190 

Ser Gly Leu Ser Lye Gin He Gin Leu Ala Gin Gly Aen Asn Gly He 
195 200 205 

Phe Arg Thr Pro He Val Leu Val Aep Aon LystSly Asn Arg Val Gin 
210 215 220 

He Thr Asn Val Thr Ser Lye Val Val Thr Ser Asn He Gin Leu Leu 
225 230 235 240 

Leu Asn Thr Arg Aen He Ala Glu Gly Asp Asn Gly Aep Val Ser Thr 

245 250 255 

Thr His Gly Phe Ser Ser Thr 
260 

(2) INFORMATION FOR SEQ ID NO: 8 J 

(i) SEQUENCE CHARACTERISTICS! 

(A) LENGTH: 250 amino acids 

(B) TYPE: amino acid 
•(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: 

Ala Pro Thr Leu Glu Thr He Ala Ser Leu Asp Leu Asn Asn Pro Thr 
1 5 10 15 . 

Thr Tyr Leu Ser Phe He Thr Asn He Arg Thr Lys Val Ala Asp Lys 
20 . 25 30 

Thr Glu Gin Cys Thr He Gin Lys He Ser Lys Thr Phe Thr Gin Arg 
35 40 45 

Tyr Ser Tyr He Asp Leu He Val Ser Ser Thr Gin Lys He Thr Leu 
50 55 60 

Ala He Asp Met Ala Asp Leu Tyr Val Leu Gly Tyr Ser Asp He Ala 
65 70 75 80 

Aen Asn Lye Gly Arg Ala Phe Phe Phe Lys Asp Val Thr Glu Ala Val 

85 90 95 

Ala Aen Asn Phe Phe Pro Gly Ala Thr Gly Thr Asn Arg He Lys Leu 
100 105 110 
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Thr Phe Thr Gly Ser Tyr Gly Aep Leu Glu Lyo Aen Gly Gly Leu Arg 
115 120 125 

lAF^B- Aep-ABn— Pro— Leu-G4y— I4-e-Phe-Arg -Xieu— Gl-\2— Asn-Se Va.l Asn 

130 135 140 

lie Tyr Gly Lye Ala Gly Asp Val Lye Lye Gin Ala Lye Phe Phe Leu 
145 150 155 160 

Leu Ala lie Gin Met Val Ser Glu Ala Ala Arg Phe Lys Tyr lie Ser 

165 170 175 

Aep Lye lie Pro Ser Glii Lye Tyr Glu Glu Val Thr Val Asp Glu Tyr 
180 185 190 

Met Thr Ala Leu Glu Aen Aen Trp Ala Lye Leu Ser Thr Ala Val Tyr 
195 200 205 

Aen Ser Lye Pro Ser . Thr Thr Thr Ala Thr Lye Cys Gin Leu Ala Thr 
210 215 220 

Ser Pro Val Thr lie Ser Pro Trp lie Phe Ly» Thr Val Glu Glu lie 

225 230 235. 240 

Lye Leu Val Met Gly Leu Leu Lye Ser Ser 

245 250 

(2) XKFORMATION FOR S£Q ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 261 amino acide 

(B) TYPE: amino acid 
(D) TOPOLCXSY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

lie Aen Thr lie Thr Phe Aep Ala Gly Aen Ala Thr lie Aen Lye Tyr 
1 5 10 15 

Ala Thr Phe Met Glu Ser Leu Arg Aen Glu Ala Lys Asp Pro Ser Leu 
20 25 30 

Lys Cys Tyr Gly lie Pro Met Leu Pro Aen Thr Aen Ser Thr lie Lyo 
35 40 45 

Tyr Leu Leu Val Lys Leu Gin Gly Ala Ser Leu Lys Thr lie Thr Leu 
50 55 60 

Met Leu Arg Arg Asn Aen Leu Tyr Val Met Gly Tyr Ser Asp Pro Tyr 
65 70 75 80 

Asp Asn Lye Cys Arg Tyr His He Phe Aen Asp He Lys Gly Thr Glu 

85 90 95 

Tyr Ser Asp Val Glu Aen Thr Leu Cye Pro Ser Ser Asn Pro Arg Val 
100 105 110 

Ala Lys Pro He Asn Tyr Aon Gly Leu Tyr Pro Thr Leu Glu Lys Lys 
115 120 125 

Ala Gly Val Thr Ser Arg Aen Glu Val Gin Leu Gly He Gin He Leu 
130 135 140 



1 
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Ser Scr Lys He Gly Lys He Ser Gly G^ln Gly Ser Phe Thr Glu Lye 
145 150 155 160 

He Glu Ala Aop Phe Leu Leu Val Ala He Gin Met Val Ser Glu Ala 

165 170 175 

Ala Arg Phe Lye Tyr He Glu Aan Gin Val Lys Thr Aen Phe Asn Arg 
180 185 190 

ABp Phe Ser Pro Asn Asp Lys Val Leu Asp Leu Glu Glu Asn Trp Gly 
195 200 205 

Lys He Ser Thr Ala He His Asn Ser Lys Asn Gly Ala Leu Pro Lys 
210 215 220 

Pro Leu Glu Leu Lys Aen Ala Asp Gly Thr Lys Trp He Val Leu Arg 
- 225 230 235 240 

Val Asp Glu He Lye Pro. Asp Val Gly Leu Leu Asn Tyr Val Asn Gly 

245 250 255 

Thr Cys Gin Ala Thr 
260 

(2) INFORMATION FOR SEQ ID NOrlOs 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 259 amino acids 

(B) TYPE: amino acid 
(D> TOPOLOGY s linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Val Thr Ser He Thr Leu Asp Leu Val Asn Pro Thr Ala Gly Gin Tyr 
1 5 10 15 

Ser Ser Phe Val Asp Lys He Arg Asn Asn Val Lys Asp Pro Asn Leu 
20 25 30 

Lye Tyr Gly Gly Thr Asp He Ala Val He Gly Pro Pro Ser Lys Glu 
35 40 45 

Lys Phe Leu Arg He Asn Phe Gin Ser Ser Arg Gly Thr Val Ser Leu 
50 55 60 

Gly Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met Asp 
65 70 75 80 

Asn Thr Asn Val Asn Arg Ala Tyr Tyr Phe Arg Ser Glu He Thr Ser 

85 90 95 

Ala Glu Ser Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn Gin Lys 
100 105 110 

Ala Leu Glu Tyr Thr Glu Asp Tyr Gin Ser He Glu Lys Asn Ala Gin 
115 120 125 

He Thr Gin Gly Asp Gin Ser Arg Lys Glu Leu Gly Leu Gly He Asp 
130 135 140 

Leu Leu Ser Thr Ser Met Glu Ala Val Asn Lys Lys Ala Arg Val Val 
145 150 155 160 
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Lys Asp Glu Ala Arg Phe Leu Leu lie Ala lie Gin Met Thr Ala Glu 

165 170 175 




Aen Lys Phe Aen Ser Glu Aen Lye Val lie Gin Phe Glu Val Aen Trp 
195 200 205 



Lys Lys lie Ser Thr Ala lie Tyr Gly 



210 215 



Asp Ala Lys Aen Gly Val Phe 
220 



Aen Lys Asp Tyr Asp Phe . Gly Phe Gly 



Lys Val Arg Gin Val Lys Aep 
235 240 



225 230 



Leu Gin Met Gly Leu Leu Met Tyr Leu 

245 



Gly Lys Pro Lye Ser Ser Aen 
250 255 



Glu Ala Aen 



(2) INFORMATIOK FOR SEQ ID NO:llr 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 baec pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 11: 



GGGCTAGATA 


CCGTGTCATT 


CTCAACCAAA 


GGTGCCACTT 


ATATTACCTA 


CGTGAATTTC 


60 


TTGAATGAGC 


TACGAGTTAA 


ATTGAAACCC 


GAAGGTAACA 


GCCATGGAAT 


CCCATTGCTG 


120 


CGCAAAAAAT 


GTGATGATCC 


TGGAAA6TGT 


TTCGTTTTGG 


TAGCGCTTTC 


AAATGACAAT 


180 


GGACAGTTGG 


CGGAAATAGC 


TATA6AT6TT 


ACAAGTGTTT 


ATGTGGTGGG 


CTATCAAGTA 


240 


AGAAACAGAT 


CTTACTTCTT 


TAAAGATGCT 


CCAGATGCTG 


CTTACGAAGG 


CCTCTTCAAA 


300 


AACACAATTA 


AAACAAGACT 


TCATTTTGGC 


GGCAGCTATC 


CCTCGCTGGA 


AGGTGAGAA6 


360 


GCATATAGAG 


AGACAACAGA 


CTTGGGCATT 


6AACCATTAA 


GGATTGGCAT 


CAAGAAACTT 


420 


GATGAAAATG 


CGATAGACAA 


TTATAAACCA 


ACGGAGATAG 


CTAGTTCTCT 


ATTGGTTGTT 


480 


ATTCAAATGG 


TGTCTGAAGC 


AGCTCGATTC 


ACCTTTATT6 


AGAACCAAAT 


TAGAAATAAC 


540 


TTTCAACAGA 


GAATTCGCCC 


GGCGAATAAT 


ACAATCAGCC 


TTGAGAATAA 


ATGGGGTAAA 


600 


CTCTC6TTCC 


AGATCCGGAC 


ATCAGGTGCA 


AATGGAATGT 


TTTCGGAGGC 


AGTTGAATTG 


660 


GAACGTGCAA 


ATGGCAAAAA 


ATACTATGTC 


ACCGCAGTTG 


ATCAAGTAAA 


ACCCAATIATA 


720 


GCACTCTTGA 


AGTTCGTCGA 


TAAAGATCCT 


AAAACG AG CC 


TTGCTGCTGA 


ATTGATAATC 


780 


CAGAACTATG 


AGTCATTAGT 


GGGCTTTGAT 


TAG 






813 
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(2) INFORMATION FOR SEQ ID NO: 12;. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 846 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 12: 



ATGGCGGCAA 


AGATGCCGAA 


GAACGTGGAC 


AAGCCGCTCT 


TCACCGCGAC 


GTTCAACGTC 


60 


CAG6CCAGCT 


CCGCCGACTA 


CGCCACCTTC 


ATCGCCGGCA 


TCCGCAACAA 


GCTCCGCAAC 


120 


CCGGCGCACT 


TCTCCCACAA 


CCGCCCCGTG 


CTGCCGCCGG 


TCGA6CCCAA 


CGTCCC6CCG 


180 


AGCAGGTGGT 


TCCACGTCGT 


GCTCAAGGCC 


TCGCCGACCA 


GCGCCGGGCT 


CACGCTGGCC 


240 


ATCCGCGCGG 


ACAACATCTA 


CCTGGAGGGC 


TTCAAGAGCA 


GCGACGGCAC 


CTGGTGGGA6 


300 


CTCACCCCGG 


GCCTCATCCC 


CGGCGCCACC 


TACGTCGGGT 


TCGGCGGCAC 


CTACC6C6AC 


360 


CTCCTCGGCG 


ACACCGACAA 


GCTAACCAAC 


GTCGCTCTCG 


GCCGACA6CA 


GCTGGCGGAC 


420 


GCGGTGACCG 


CGCTCCACGG 


GCGCACCAAG 


GCCGACAAGG 


CCTCCGGCCC 


GAAGCA6CA6 


480 


CAGGCGAGGG 


AGGCGGTGAC 


GACGCTGGTC 


CTCATGGTGA 


ACGAGGCCAC 


6CGGTTCCAG 


540 


ACGGTGTCTG 


GGTTCGTGGC 


CGGGTTGCTG 


CACCCCAAGG 


CGGTGGAGAA 


GAAGAGCGGG 


600 


AAGATCGGCA 


ATGAGATGAA 


GGCCCAGGTG 


AACGGGTGGC 


AGGACCTGTC 


CGCG6CGCTG 


660 


CTGAAGACGG 


ACGTGAAGCC 


TCCGCCGGGA 


AAGTCGCCAG 


CGAAGTTCGC 


GCCGATCGAG 


720 


AAGATGGGCG 


TGAGGACGGC 


TGAACAGGCC 


GCCAACACGC 


TGGGGATCCT 


GCTGTTCGTG 


780 


GAGGTGCC66 


GTGGGTTGAC 


GGTGGCCAAG 


GCGCTGGA6C 


TGTTCCATGC 


GAGTGGT66G 


840 


AAAtAG 












846 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 913 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13$ 

CGTCCGAAAA TGGTGAAATG CTTACTACTT TCTTTTTT7UV TTATCGCCAT CTTCATTGGT 60 

GTTCCTACTG CCAAAGGCGA TGTTAACTTC GATTTGTCGA CTGCCACTGC AAAAACCTAC 120 

ACAAAATTTA TCGAAGATTT CAGGGCGACT CTTCCATTTA GCCATAAAGT GTATGATATA 180 

CCTCTACTGT ATTCCACTAT TTCCGACTCC AGACGTTTCA TACTCCTCGA TCTTACAAGT 240 

TATGCATATG T^CCATCTC GGTGGCCATA GATGTGACGA ACGTTTAT6T TGTGGCGTAT 300 
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CGCACCCGCG ATGTATCCTA CTTTTTTAAA GAATCTCCTC CTGAAGCTTA TAACATCCTA 360 

TTCAAAGGTA CGCGGAAAAT TACACTGCCA TATACCGGTA ATTATGAAAA TCTTCAAACT 420 

GCTGCACACA AAATAAGAGA GAATATTGAT CTTGGACTCC CTGCCTTGAG TAGTGCCATT 480 

ACCACATTGT TTTATTACAA TGCCCAATCT GCTCCTTCTG CATTGCTTGT ACTAATCCAG 540 

ACGACTGCAG AAGCTGCAAG ATTTAAGTAT ATCGAGCGAC ACGTTGCTAA GTATGTTGCC 600 

ACTAACTTTA AGCCAAATCT AGCCATCATA AGCTTGGAAA ATCAATGGTC TGCTCTCTCC 660 

AACAAATCTT TTTGGCGCAG AATCAAGGAG GAAAATTTAG AAATCCTGTC GACCTTATAA 720 

AACCTACCGG GGAACGGTTT CAAGTAACCA ATGTTGATTC AGATGTTGTA AAAGGTAATA 780 

TCAAACTCCT GCTGAACTCC AGAGCTAGCA CTGCTGATGA AAACTTTATC ACAACCATGA 840 

CTCTACTTGG GGAATCTGTT GTGAATTGAA AGTTTAATAA TCCACCCATA TCGAAATAAG 900 

GCATGTTCAT GAC 913 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base paire 

(B) TYPE: nucleic- acid 

(C) STRANDEDNESS: eingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
TTYAARGAYG CNCCNGAYGC NGCNTAYGAR G6 32 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base paire. 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPES DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TTYTTYATRA TRCANTGNCG NCANCTRGTY CA 32 
(2). INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:. 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



4 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GGNYTNGAYA CNGTNWSNTT YWSNACNAAR GG 
(2) INFORHATXON-FOR SEg-ID-NO: 17 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



32 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
TGTCTGAACC CGTAACTTGG TAA 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base paire 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



23 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18s 
CACTCTTCCG TATCTCTCTG T 21 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQX7ENCE DESCRIPTION: SEQ ID NO: 19: 
TCAACCCGGG CTAGATACCG T6TCATTCTC AACCAAA6GT GCCACTTATA TTA 53 
(2) INFORMATION FOR SEQ ID N0:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CTTCATTTTG GCGGCACGTA TCC 



23 
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(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
(AJ~I>ENGTH:- 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANBEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 
CTCGAGGCTG CAAGCTTACG TGGGATTTTT xTTTTTTTTT TTTTTT 46 
■{2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single ^ 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 1 
CTCGCTGGAA GGTGAGAA 18 
(2) INFORMATION FOR SEQ ID NO: 23s 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLEOJLB TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
CTCGAGGCTG CAAGCTTACG TGGGA .25 
(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHJUIACTERISTICS; 

(A) LENGTH: 35 base pairo 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
TGATCTCGAG TACTATTTAG GATCTTTATC GACGA 



35 
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(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 
6TAAGCAGCA TCTGGAGCAT CT 
(Z) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: . ' 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 
CATTCAAGAA ATTCACGTAG G 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHTUIACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27 
GGCCTGGACA CCGTGAGCTT TAG 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: • 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 
TCGATTGCGA TCCTAAATAG TACTC 
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(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO; 29 
TTTAGGATCG CAATCGACGA ACTTCAAG 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 
GTTCGTCTGT AAAGATCCTA AATAGTACTC GA 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 
GGATCTTTAC AGACGAACTT CAAGAGT 
(2) INFORMATION FOR. SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 
TCTTGTGCTT CGTCGATAAA GATCC 
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(2) INFORMATION FOR SEQ ID NO: 33:. 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 baee pairs 
(B) -TYPE:— nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

ATCGACGAA6 CACAAGAGTG CTATTTT 

(2)* INFORMATION FOR SEQ ID NO:34: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 32 bdee pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 
GTAAAACCAT GCATAGCACT CTTGAAGTTC GT 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic, acid 

(C) STRTkNDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
AGTGCTATGC ATGGTTTTAC TT6ATCAACT GC 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
AGCACATGTG GTGCCACTTA TATTACCTA 



wo 94/26910 



-163- 

(2) INFORMATION FOR SEQ ID NO: 37; 

(i) SEQUENCE CHARACTERISTICS: 

(A) -LENGTH: -33 - base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE:. DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37 
TAACTGGCAC CACATGT6CT AAA6CTCACG GT6 
f2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 
TGACTGTGGA CAGTTGGCGG AAATA 
(2) INFORMATION FOR SEQ ID NO:39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39 
GCCACTGTCC ACAGTCATTT GAAA6CGCTA CC 
(2) INFORMATION FOR SEQ ID NO:40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40 
GATGATCCTG GAAAGGCTTT CGTTTTGGTA GCGCTT 
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(2) INFORMATION FOR SEQ ID NO: 41s 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: "nucleic iacid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
AAGCCTTTCC AGGATCATCA GCTTTTTTGG GCAGCAATGG G 
{2} INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 baee pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
AAGCCTTTCC AGGATCATCA CAT 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
GCGACTCTCT ACTGTTTC 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid ' 

(C) STRAYIDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
CGTTAGCAAT TTAACTGTGA T 
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(2) INFORMATION FOR SEQ " ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 
(A)-tENGTH: -16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
AACAGCTATG ACCATG 16 
i2) INFORMATION FOR SEQ ID NO: 46: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single % 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 
TGAACTCGAG GAAACTACCT ATTTCCCAC 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
GCATTACATC CATGGCGGC 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
GATATCTCGA GTTAACTATT TCCCACCACA CGCATGGAAC AGCTCCAGCG CCTTG6CCAC 
CGTC 



60 
64 
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(2) INFORMATION FOR SEQ ID NO:49: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
-( B-)- TYPE nu c 1 e i c— a ci d- 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49 
T6TCT6TTCG TGGAGGTGCC 6 
(2y INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:50 
CCAAGTGTCT 6GA6CTGTTC CATGCGA 
(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51 
GATGTTTiAYT TYGAYTTGTC NACDGCTAC 
(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: ; 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52 
ATT6GNAGDG TAGCCCTRAA RTCYTCDAT 
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(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
GCCACTGCAA AAACCTACAC AAAATTTATT GA 
(2) INFORMATION FOR SEQ ID NO: 54 J 

(i) SEQUENCE CHARACTERS SXICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid % 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQtlENCE DESCRIPTION: SEQ ID NO: 54: 
GATGTTAACT TCGATTTGTC GA 
(2) INF0301ATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
TCAACTCGAG GTACTCAATT CACAACAGAT TCC 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Cvs His His His Ala Ser Arg Val Ala Arg Met Ala Ser Asp Glu Phe 
/ 5 10 15 
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Pro Ser Met Cye 
20 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Pro Ser Gly Gin Ala Gly Ala Ala Ala Ser Glu Ser Leu Phe He Ser 
1 5 - 10 15 

Aen His Ala Tyr 

20 ^ 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
CAGCCATGGA ATCCCATTGC TG 22 
(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 baee pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS s single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
CACATGTAAA ACAAGACTTC ATTTTGGC 28 
(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60 
TGAAGTCTTG TTTTAGATGT GTTTTTGAAG AGGCCT 
(2) INFORMATION TOR SEQ ID NO: 61: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61 

ATGCCATATG CAATTATAAA CCAACGGAGA 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 39 base paire 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 62 
GGTTTATAAT TGCATATGGC ATTTTCATCA AGTTTCTTG 
(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 
CTTTCAACAA TGCATTCGCC CGGCGAATAA TAC 
(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 
GCGAATGCAT TGTTGAAAGT TATTTCTAAT TTG 
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(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 baee pairs 
—fB)~TYPE: -nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SiSQ ID NO: 65: 
GTTTTGTGAG GCAGTTGAAT TGGAAC 
(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
TTCAACTGCC TCACAAAACA TTCCATTTGC ACCT 
(2) INFORMATION FOR SEQ ID NO: 67: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
AAAAGCTGAT GATCCTGGAA AGTG 
(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 baae pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
TCCAGGATCA TCAGCTTTTT TGCGCAGCAA TGGGA 
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(2) INFORMATION FOR SEQ ID NO: 69: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 321 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

GACATCCAGA TGACTCAGTC tCCATCTTCC ATGTCTGCAT CTCTGGGAGA CAGAGTCACT 60 

ATCACTTGCC GGGCGAGTCA GGACATTAAT AGCTATTTAA GCTGGTTCCA GCAG7VAACCA 120 

GGGAAATCTC CTAAGACCCT GATCTATCGT GCAAACA6AT TGGTAGATGG GGTCCCATCA 180 

AGGTTCAGTG GCAGTGGATC TGGGACAGAT TATACTCTCA CCATCAGCAG CCTGCAATAT 240 

GAAGATTTTG GAATTTATTA TTGTCAACAG TATGATGAGT CTCCGTGGAC GTTCGGTGGA 300 

GGCACCAAGC TTGAAATCAA A 321 
(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 354 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

CAGATCCAGT TGGTGCAGTC TGGACCTGGC CTGAAGAAGC CTGGAGGGTC CGTCAGAATC 60 

TCCTGCGCAG CTTCTGGGTA TACCTTCACA AACTATGGAA TGAACTGGGT GAAGCAG6CT 120 

CCAGGAAAGG GTTTAAGGTG GATGGGCTGG ATAAACACCC ACACTGGAGA GCCAACATAT 180 

GCTGATGACT TCAAGGGACG 6TTTACCTTC TCTTTGGACA CGTCTAAGAG CACTGCCTAT 240 

TTACAGATCA ACAGCCTCAG AGCCGAGGAC ACGGCTACAT ATTTCTGTAC AAGACG6GGT 300 

TACGACTGGT ACTTCGATGT CTGGGGCCAA GGGACCACGG TCACCGTCTC CTCC . 354 
(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 354 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
GAGATCCAGT TGGTGCAGTC TGGAGGAGGC CTGGTGAAGC CTGGAGGGTC CGTCAGAATC 



60 
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TCCTGCGCAG CTTCTGGGTA TACCTTCACA AACTATGGAA TGAACTGGGT GCGCCAGGCT 120 

CCAGGAAAGG GTTTAGAGTG GATGGGCTGG ATAAACACCC ACACTGGAGA GCCAACATAT 180 

GCTGATTCTf"TCAAGGG GTTTACCTTC TCTTTGGACG ATTCTAAGAA CACTGCCTAT 240 

TTACAGATCA ACAGCCTCAG AGCCGAGGAC ACGGCTGTGT ATTTCTGTAC AAGACGGGGT 300 

TACGACTGGT ACTTCGATGT CTGGGGCCAA GGGACCACGG TCACCGTCTC CTCC 354 



(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 321 base pair 8 

(B) TYPE: nucleic acid 

(C) STRANPEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

GACATCCAGA TGACTCAGTC TCCATCTTCC CTGTCTGCAT CTGTAGGAGA CAGAGTCACT 60 

ATCACTTGCC GGGCGAGTCA GGACATTAAT AGCTATTTAA GCTGGTTCCA GCA6AAACCA 120 

GGGAAAGCTC CTAAGACCCT GATCTATCGT GCAAACAGAT TGGAlATCTGG GGTCCCATCA 180 

AGGTTCAGTG GCAGTGGATC TGGGACAGAT TATACTCTCA CCATCAGCAG CCTGCAATAT 240 

GAAGATTTTG GAATTTATTA TTGTCAACAG TATGATGAGT CTCCGTGGAC GTTCGGTGGA 300 

GGCACCAAGC TTGAAATCAA A 321 



(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOs73: 
TGTCATCATC ATGCATCGCG AGTTGCCAGA ATGGCATCTiS ATGAGTTTCC TTCTATGTGC 60 
GCAAGTACTC 70 
(2) INiPORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTIONS SEQ ID NOs74: 
TCGAGAGTAC TTGCGCACAT AGAAGGA7UVC TCATCAGATG CCATTCTGGC AACTCGCGAT 60 
GCATGATGAT GACATGCA 78 
(2) INFORMATION FOR SEQ ID NO: 75; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 baee pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Bingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
TGTTCGGCCG CATGTCATCA TCATiiCATCG 
(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 
AGTCATGCCC CGCGC 15 
(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
TCCCGGCTGT CCTACAGT 18 
(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
TCCAGCCTGT CCA6ATGGTG TGTGAGTTTT GTCACAA 37 
(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 baoe pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Bingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:-79: 
CTAACTCGAG AGTACTGTAT GCATGGTTCG AGATGAACAA AGATTCTGAG GCTGCAGCTC 60 
CAGCCTGTCC AGATGG ^ 76 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8D: 
CTAACTCGAG AGTACTGTAT 20 
(2) INFORMATION FOR SEQ ID NO: 81: 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:81: 
TCCAGCCTGT CC AGATGG AC ACTCTCCCCT GTTGAA .36 
(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRAI^EDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:82: 
GTACAGTGCA AGGTGGAT 
(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 
CATGCGGCCG ATTTAGGATC TTTATCGACG A 
(2) INFORMATION FOR SEQ ID NO:84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:84 

AACATCCAGT TGGTGCAGTC TG 

(2) INFORMATION FOR SEQ ID NO; 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:85 
GAGGAGACGG TGACCGTGGT 
(2) INFORMATION FOR SEQ ID NO:86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86 
GACATCAAGA TGACCCAGT 
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(2) INFORMATION FOR SEQ ID NO: 87: 



(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Bingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
GTTTGATTTC AAGCTTGGTG C 
{2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:88: 

ACTTCGGCC6 CACCATCTGG ACAGGCTGGA G 
(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 723 baee pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



21 



31 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

GACATCCAGA TGACTCAGTC TCCATCTTCC CTGTCTGCAT CTGTAGGAGA CAGAGTCACT 60 

ATCACTTGCC GGGCGAGTCA GGACATTAAT AGCTATTTAA GCTGGTTCCA GCAGAAACCA 120 

GGGAAAGCTC CTAAGACCCT GATCTATCGT GCAAACA6AT TGGAATCTG6 GGTCCCATCA 180 

AGGTTCAGTG GCAGTGGATC TGGGACAGAT TATACTCTCA CCATCAGCAG CCTGCAATAT 240 

GAAGATTTTG 6AATTTATTA TTGTCAACAG TATGATGA6T CTCCGTGGAC GTTC6GTGGA 300 

GGCACCAAGC TTGAGATGAA AGGTGGCGGT GGATCTGGTG GAGGTGGGTC C66AGGTGGA 360 

GGATCTGAGA TCCAGTTGGT GCAGTCTGGA GGAGGCCTGG TGAAGCCTGG AGGGTCCGTC 420 

AGAATCTCCT GCGCAGCTTC TGGGTATACC TTCACAAACT ATGGAATGAA CTGGGTGCGC 480 

CAGGCTCCAG GAAAGGGTTT AGAGTGGATG GGCTGGATAA ACACCCACAC TGGAGAGCCA 540 

ACATATGCT6 ATTCTTTCAA GGGACGGTTT ACCTTCTCTT TGGACGATTC TAAGAACACT 600 

GCCTATTTAC AGATCAACAG CCTCAGAGCC GAGGACACGG CTGTGTATTT CTGTACAAGA 660 
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CGGGGTTACG ACTGGTACTT CGATGTCTGG GGCCAAGGGA CCACGGTCAC CGTCTCCTCA 720 
TGA 723 
(2) INFORMATION FOR SEQ ID NO:90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 723 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:90: 

GAGATCCAGT TGGTGCAGTC TGGAGGAGGC CTGGTGAAGC CTGGAGGGTC CGTCAGAATC 60 

TCCTGCGCAG CTTCTGGGTA TACCTTCACA AACTATGGAA TGAACTGGGT GCGCCAGGCT" 120 

CCAGGAAAGG GTTTAGAGT6 GATGGGCTGG ATAAACACCC ACACTGGAGA GCCAACATAT 180 

GCTGATTCTT TCAAGGGACG GTTTACCTTC TCTTTGGACG ATTCTAAGAA CACTGCCTAT 240 

TTACAGATCA ACAGCCTCAG AGCCGAGGAC ACGGCTGTGT ATTTCTGTAC AAGACGGGGT 300 

TACGACTGGT ACTTCGATGT CTGGGGCCAA GGGACCACGG TCACCGTCTC CTCAGGTGGC 360 

GGTGGATCTG GTG6AGGTGG GTCCGGAGGT GGAGGATCT6 ACATCCAGAT GACTCAGTCT 420 

CCATCTTCCC TGTCTGCATC TGTAGGAGAC AGAGTCACTA TCACTTGCCG GGCGAGTCAG 480 

GACATTAATA GCTATTTAAG CTGGTTCCAG CAGAAACCAG GGAAAGCTCC TAAGACCCTG 540 

ATCTATCGTG CAAACAGATT GGAATCTGGG GTCCCATCAA GGTTCAGTGG CAGTGGATCT 600 

GGGACAGATT ATACTCTCAC CATCAGCAGC CTGCAATATG AAGATTTTGG AATTTATTAT 660 

TGTCAACAGT ATGATGAGTC TCCGTGGACG TTCGGTGGAG GCACCAAGCT TGAGATGAAA 720 

TGA 723 
(2) INFORMATION FOR SEQ ID NOs91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 
CGGACCCACC TCCACCAGAT CCACCGCCAC CTTTCATCTC AAGCTTGGTG C 51 
(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 



(xil SEQUENCE DESCRIPTION: 5EQ ID NO; 92: 
GACATCCAGA TGACTCAGT 
(2) INFORMATION FOR SEQ ID NO: 93: 

<i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 49 baee pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: ^ 
GGTGGAGGTG GGTCCGGAGG TGGAGGATCT GAGATCCAGT TGGTGCAGT 
(2) INFORMATION FOR SEQ ID NO: 94s 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
TGTACTCGAG CCCATCATGA GGAGACGGTG ACCGT 
(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 
GACATCCAGA TGACTCAGT 
(2) INFORMATION FOR SEQ ID NO:96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCKIPTION: SEQ ID NO:96: 
TGTACTCGAG CCCATCATTT CATCTCAAGC TTGGTGC 
(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 97: 
GAGATCCAGT TGGTGCAGTC TG 
(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:98: 

CGGACCCACC TCCACCAGAT CCACCGCCAC CTGA6GAGAC 6GTGACCGT 49 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

,(A) LENGTH: 251 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

Gly Leu Asp Thr Val Ser Phe Ser Thr Lys Gly Ala Thr Tyr lie Thr 

Tyr Val Aen Phe Leu Asn Glu Leu Arg Val Lys Leu Lys Pro Glu Glv 
20 25 30 

Asn Ser His Gly lie Pro Leu Leu Arg Lys Lys Cys Asp Asp Pro Gly 
35 40 45 

Lys Ala Phe Val Leu Val Ala Leu Ser Asn Asp Asn Gly Gin Leu Ala 
50 55 60 

Glu He Ala He Asp Val Thr Ser Val Tyr Val Val Gly Tyr Gin Val 
65 70 75 80 

Arg Asn Arg Ser Tyr Phe Phe Lys Asp Ala Pro Asp Ala Ala Tyr Glu 

85 90 95 
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Gly Leu Phe Lys Aen Thr lie Lye Thr Arg Leu HiD Phe Gly Gly Ser 
100 105 110 

Tyr- Pro Ser Leu Glu Gly Glu Lys Ala Tyr Arg Glu Thr Thr Asp Leu 
115 120 125 

Gly lie Glu Pro Leu Arg lie Gly lie Lys Lys Leu Asp Glu Asn Ala 
130 135 140 

lie Asp Asn Tyr Lys Pro Thr Glu He Ala Ser Ser Leu Leu Val Val 
145 150 155 160 

He Gin Met Val Ser Glu Ala Ala Arg Phe Thr Phe He Glu Asn Gin 

165 170 175 

He Arg Asn Asn Phe Gin Gin Arg He Arg Pro Ala Asn Asn Thr He 
180 185 > 190 

Ser Leu Glu Asn Lys Trp Gly Lys Leu Ser Phe Gin He Arg Thr Ser 
195 200 205 

Gly Ala Asn Gly Met Phe Ser Glu Ala Val Glu Leu Glu Arg Ala Asn 
210 215 220 

Gly Lys Lys Tyr Tyr Val Thr Ala Val Asp Gin Val Lys Pro Lys He 
225 230 235 240 

Ala Leu Leu Lys Phe Val Asp Lys Asp Pro Xys 

245 250 

(2) INFORMATION FOR SEQ ID NO: 100: 

( i ) SEQXJENCE CHARACTERI STI CS : 

(A) LENGTH: 251 amino acids 

(B) TYPE: amino acid 
(0) TOPOLOGY s linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:100: 

Gly Leu Asp Thr Val Ser Phe Ser Thr Lys Gly Ala Thr Tyr He Thr 
15 10 15 

Tyr Val Asn Phe Leu Asn Glu Leu Arg Val Lys Leu Lys Pro Glu Gly 
20 25 30 

Asn Ser His Gly He Pro Leu Leu Arg Lys Lys Ala Asp Asp Pro Gly 
35 40 45 

Lys Cys Phe Val Leu Val Ala Leu Ser Asn Asp Asn Gly Gin Leu Ala 
50 55 60 

Glu He Ala He Asp Val Thr Ser Val Tyr Val Val Gly Tyr Gin Val 
65 70 75 80 

Arg Asn Arg Ser Tyr Phe Phe Lys Asp Ala Pro Asp Ala Ala Tyr Glu 

85 90 95 

Gly Leu Phe Lys Aen Thr He Lys Thr Arg Leu His Phe Gly Gly Ser 
100 105 110 

Tyr Pro Ser Leu Glu Gly Glu Lys Ala Tyr Arg Glu Thr Thr Asp Leu 
115 120 125 
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Gly lie Glu Pro Leu Arg lie Gly lie Lys Lys Leu Asp Glu Asn Ala 
130 135 140 

lie Asp Asn Tyr Lys Pro Thr Glu lie Ala Ser Ser Leu Leu Val Val 
145 150 155 * 160 

lie Gin Met Val Ser Glu Ala Ala Arg Phe Thr Phe lie Glu Asn Gin 

165 170 175 

lie Arg Asn Aen Phe Gin Gin Arg He Arg Pro Ala Asn Asn Thr. He 
180 185 190 

Ser Leu Glu Asn Lye Trp Gly Lys Leu Ser Phe Gin He Arg Thr Ser 
195 200 205 

Gly Ala Asn Gly Met Phe Ser Glu Ala Val Glu Leu Glu Arg Ala Asn 
210 215 220 

Gly Lye Lys Tyr Tyr Val Thr Ala Val Asp Gin Val Lya Pro Lys He 
225 230 235 240 

Ala Leu Leu Lys Phe Val Asp Lys Asp Pro Lys « 

245 250 

(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:101: 

Gly Leu Asp Thr ' Val Ser Phe Ser Thr Lys Gly Ala Thr Tyr lie Thr 
15 10 15 

Tyr Val Asn Phe Leu. Asn Glu Leu Arg Val Lys Leu Lys Pro Glu Gly 
20 25 30 

Aen Ser His Gly He Pro Leu Leu Arg Lys Lye Ala Asp Asp Pro Gly 
35 40 45 

Lys Ala Phe Val Leu Val Ala Leu Ser Asn Asp Asn Gly Gin Leu Ala 
50 55 60 

Glu He Ala He Asp Val Thr Ser Val Tyr Val Val Gly Tyr Gin Val 
65 70 75 80 

Arg Asn Arg Ser Tyr Phe Phe Lys Asp Ala Pro Asp Ala Ala Tyr Glu 

85 90 95 . 

Gly Leu Phe Lys Asn Thr He Lye Thr Arg Leu His Phe Gly Gly Ser 
100 105 110 

Tyr Pro Ser Leu Glu Gly Glu Lys Ala Tyr Arg Glu Thr Thr Asp Leu 
115 120 125 

Gly He Glu Pro Leu Arg He Gly He Lys Lys Leu Asp Glu Asn Ala 
130 135 140 

He Asp Asn Tyr Lys Pro Thr Glu He Ala Ser Ser Leu Leu Val Val 
145 150 155 160 



« 
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Ile Gin Met Val Ser Glu Ala Ala Arg Phe Thr Phe lie Glu Aen Gin 

165 170 175 

He Arg Asn Asn Phe Gin Gin Arg He Arg Pro Ala Asn Aen Thr He 

.2-80 185 . 190 

Ser Leu Glu Asn Lys Trp Gly Lye Leu Ser Phe Gin He Arg Thr Ser 
195 200 205 

Gly Ala Aen Gly Met Phe Ser Glu Ala Val Glu Leu Glu Arg Ala Asn 
210 215 220 

Gly Lye Lye Tyr Tyr Val Thr Ala Val Aep Gin Val Lys Pro Lys He 
225 230 235 240 

^ Ala Leu Leu Lys Phe Val Asp Lys Asp Pro Lys 

245 250 

(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: ^ 

(A) LENGTH: 251 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102s 

Gly Leu Asp Thr Val Ser Phe Ser Thr Lys Gly Ala Thr Tyr He Thr 
15 10 15 

Tyr Val Asn Phe Leu Asn Glu Leu Arg Val Lys Leu Lys Pro Glu Gly 
20 25 30 

Asn Ser His Gly He Pro Leu Leu Arg Lys Lys Cys Asp Asp Pro Gly 
35 40 45 

Lys Cys Phe Val Leu Val Ala Leu Ser Asn Asp Asn Gly Gin Leu Ala 
50 55 60 

Glu He Ala He Asp Val Thr Ser Val Tyr Val Val Gly Tyr Gin Val 
65 70 75 80 

Arg Asn Arg Ser Tyr Phe Phe Lys Asp Ala Pro Asp Ala Ala Tyr Glu 

85 90 95 

Gly Leu Phe Lys Asn Thr He Lys Thr Arg Leu His Phe Gly Gly Ser 
100 105 110 

Tyr Pro Ser Leu Glu Gly Glu Lys Ala Tyr Arg Glu Thr Thr Asp Leu 
115 120 125 

Gly He Glu Pro Leu Arg He Gly He Lys Lys Leu Asp Glu Asn Ala 
130 135 140 

He Asp Asn Tyr Lys Pro Thr Glu He Ala Ser Ser Leu Leu Val Val 
145 150 155 160 

He Gin Met Val Ser Glu Ala Ala Arg Phe Thr Phe He Glu Asn Gin 

165 170 175 

He Arg Asn Asn Phe Gin Gin Arg He Arg Pro Ala Asn Asn Thr He 
180 185 190 
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Ser Leu Glu Aon Lys Trp Gly Lye Leu Ser Phe Gin He Arg Thr Ser 
195 200 205 

Gly Ala Aen Gly Met Phe Ser Glu Ala Val Glu Leu Glu Arg Ala Asn 
210 215 220 

Gly Lye Lye Tyr Tyr Val Thr Ala Val Asp Gin Val Lys Pro Lys He 
225 230 235 240 

Ala Leu Leu Lye Phe Val Cys Lys Asp Pro Lys 

245 250 

(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acids 
z (B) TYPE: amino acid 

(D) TOPOLOGy! linear 

(ii) MOLECULK TYPE: protein 



(xi) SEQXJENCE DESCRIPTION: SEQ ID NO: 103: 

Gly Leu Asp Thr Val Ser Phe Ser Thr Lye Gly Ala Thr Tyr He Thr 
15 10 15 

Tyr Val Asn Phe Leu Asn Glu Leu Arg Val Lys Leu Lys Pro Glu Gly 
20 25 30 

Aon Ser His Gly He Pro Leu Leu Arg Lys Lys Cys Asp Asp Pro Gly 
35 40 45 

Lys Cys Phe Val Leu Val Ala Leu Ser Asn Asp Aen Gly Gin Leu Ala 
50 55 60 

Glu He Ala He Asp Val Thr Ser Val Tyr Val Val Gly Tyr Gin Val 
65 70 75 80 

Arg Asn Arg Ser Tyr Phe Phe Lys Asp Ala Pro Asp Ala Ala Tyr Glu 

85 90 95 

Gly. Leu Phe Lys Asn Thr He Lys Thr Arg Leu His Phe Gly Gly Ser 
100 105 110 

Tyr Pro Ser Leu Glu Gly Glu Lye Ala Tyr Arg Glu Thr Thr Asp Leu 
115 120 125 

Gly He Glu Pro ,Leu Arg He Gly He Lys Lye Leu Asp Glu Asn Ala 
130 135 140 

\ 

He Asp Asn Tyr Lys Pro Thr Glu He Ala Ser Ser Leu Leu Val Val 
145 150 155 160 

He Gin Met Val Ser Glu Ala Ala Arg Phe Thr Phe He Glu Aan Gin 

165 170 175 

He Arg Asn Aen Phe Gin Gin Arg He Hxg Pro Ala Asn Asn Thr He 
180 185 190 

Ser Leu Glu Asn Lys Trp Gly Lys Leu Ser Phe Gin He Arg Thr Ser 
195 200 205 

Gly Ala Asn Gly Met Phe Ser Glu Ala Val Glu Leu Glu Arg Ala Asn 
210 215 220 
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Gly Lye Lye Tyr Tyr Val Thr Ala Val Asp Gin Val Lys Pro Lye lie 
225 230 235 240 

Ala Leu Leu Lys Phe Val Asp Cys Asp Pro Lys 

245 250 

(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

Gly Leu Aap Thr Val Ser Phe Ser Thr Lys Gly Ala Thr Tyr lie Thr 
1 5 10 ^ 15 

Tyr Val Asn Phe Leu Asn Glu Leu Arg Val Lye Leu Lys Pro Glu Gly 
20 25 30 

Asn Ser His Gly lie Pro Leu Leu Arg Lys Lys Cys Asp Asp Pro Gly 
35 40 45 

Lys Cys Phe Val Leu Val Ala Leu Ser Asn Asp Asn Gly Gin Leu Ala 
50 55 60 

Glu He Ala He Asp Val Thr Ser Val Tyr Val Val Gly Tyr Gin Val 
65 70 75 80 

Arg Asn Arg Ser Tyr Phe Phe Lys Asp Ala Pro Asp Ala Ala Tyr Glu 

85 90 95 

Gly Leu Phe Lys Asn Thr He Lys Thr Arg Leu His Phe Gly Gly Ser 
100 105 110 

Tyr Pro Ser Leu Glu Gly Glu Lye Ala Tyr Arg Glu Thr Thr Asp Leu 
115 120 125 

Gly He Glu Pro Leu Arg He Gly He Lys Lye Leu Asp Glu Asn Ala 
130 135 140 

He Asp Asn Tyr Lys Pro Thr Glu He Ala Ser Ser Leu Leu Val Val 
145 150 155 160 

He Gin Met Val Ser Glu Ala Ala Arg Phe Thr Phe He 61u Asn Gin 

165 170 175 

He Arg Asn Asn Phe Gin Gin Arg He Arg Pro Ala Asn Asn Thr He 
180 185 190 

Ser Leu Glu Asn Lys Trp Gly Lys Leu Ser Phe Gin He Arg Thr Ser 
195 200 205 

Gly Ala Asn Gly Met Phe Ser Glu Ala Val Glu Leu Glu Arg Ala Asn 
210 215 220 

Gly Lys Lys Tyr Tyr Val Thr Ala Val Asp Gin Val Lys Pro Cys He 
225 230 235 240 

Ala Leu Leu Lys Phe Val Asp Lys Asp Pro Lys 

245 250 
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(2) INFORMATION FOR SEQ ID NO; 105 J 

(i) SEQUENCE CHARACTERISTICS: 
(-A )— LENGTH 2 51— amino- acid 8 



(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105; 

Gly Leu Asp Thr Val Ser Phe Ser Thr Lye Gly Ala Thr Tyr lie Thr 
1 5 10 15 

- Tyr Val Asn Phe Leu Aen Glu Leu Arg Val Lys Leu Lye Pro Glu Gly 

20 25 30 

Asn Ser His Gly lie Pro Leu Leu Arg Lya Lys Cys Aep Asp Pro Gly 
35 40 45 

Lye Cys Phe Val Leu Val Ala Leu Ser Aen Asp Aan Gly Gin Leu Ala 
50 55 60 

Glu lie Ala lie Aep Val Thr Ser Val Tyr Val Val Gly Tyr Gin Val 
65 70 75 80 

Axg Aen Arg Ser Tyr Phe Phe Lys Aep Ala Pro Asp Ala Ala Tyr Glu 

85 90 95 

Gly Leu Phe Lys Aen Thr He Lye Thr Arg Leu His Phe Gly Gly Ser 
100 105 110 

Tyr Pro Ser Leu Glu Gly Glu Lye Ala Tyr Arg Glu Thr Thr Asp Leu 
115 120 125 

Gly He Glu Pro Leu Arg He Gly He Lys Lys Leu Asp Glu Asn Ala 
130 135 140 

He Asp Asn Tyr Lye Pro Thr Glu He Ala Ser Ser Leu Leu Val Val 
145 150 155 160 

He Gin Met Val Ser Glu Ala Ala Arg Phe Thr Phe He Glu Asn Gin 

165 170 175 

He Arg Asn Asn Phe Gin Gin Arg He Arg Pro Ala Asn Asn Thr He 
180 185 190 

Ser Leu Glu Asn Lys Trp Gly Lye Leu Ser Phe Gin He Arg Thr Ser 
195 200 205 

Gly Ala Asn Gly Met Phe Ser Glu Ala Val Glu Leu Glu Arg Ala Asn 
210 215 220 

Gly Lys Lys Tyr Tyr Val Thr Ala Val Asp Gin Val' Lys Pro Lys He 
225 230 235 240 

Ala Leu Leu Cys Phe Val Aep Lys Asp Pro Lye 

245 250 

(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID-NO:106 

Gly Leu Asp Thr Val Ser Phe Ser Thr Cye Gly Ala Thr Tyr lie Thr 
1 5 10 « 

Tyr Val Asn Phe Leu Aen Glu Leu Arg Val Lye Leu Lya Pro Glu Glv 
20 25 30 

Asn Ser Hie Gly lie Pro Leu Leu TVrg Lye Lys Cye Asp Asp Pro Glv 
35 40 45 

Lye Cye Phe Val Leu Val Ala Leu Ser Asn Asp Asn Gly Gin Leu Ala 
50 55 60 

Glu He Ala He Asp Val Thr Ser Val Tyr Val Val Gly Tyr Gin Val 
65 70 75 80 

Arg Asn Arg Ser Tyr Phe Phe Lye Aep Ala Pro Aap Ala Ala Tyr Glu 

85 90 95 

Gly Leu Phe Lye Aen Thr He Lys Thr Arg Leu His Phe Gly Gly Ser 
100 105 110 

Tyr Pro Ser Leu Glu Gly Glu Lys Ala Tyr Arg Glu Thr Thr Asp Leu 
115 120 125 

Gly He Glu Pro Leu Arg He Gly He Lys Lys Leu Asp Glu Asn Ala 
130 135 140 

He Asp Asn Tyr Lys Pro Thr Glu He Ala Ser ser Leu Leu Val Val 
145 150 155 . 160 

He Gin Met Val Ser Glu Ala Ala Arg Phe Thr Phe He Glu Asn Gin 

165 170 175 

He Arg Asn Asn Phe Oln Gin Arg He Arg Pro Ala Asn Asn Thr He 
180 185 190 

Ser Leu Glu Asn Lys Trp Gly Lys Leu Ser Phe Gin He Arg Thr Ser 
195 200 205 

Gly Ala Aen Gly Met Phe Ser Glu Ala Val Glu Leu Glu Arg Ala Asn 
210 215 220 

Gly Lys Lys Tyr Tyr Val Thr Ala Val Aep Gin Val Lys Pro Lys He 
225 230 235 240 

Ala Leu Leu Lys Phe Val Asp Lys Asp Pro Lye 

245 250 

(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acide 

(B) TYPE: amino acid 
(D) TCPCLOGy: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 
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Gly Leu Asp Thr Val Ser Phe Ser Thr Lys Gly Ala Thr Tyr lie Thr 
15 10 15 

~Tyr-Val— Aen— Phe-teu— ABii-alu-Leu-Arg— Val— Lys— Leu--LyB--Pro -Glu Gly 
20 25 30 

Aon Ser His Gly lie Pro Leu Leii Arg Lys Lys Cys Asp Asp Pro Gly 
35 40 45 

Lys Cys Phe Val Leu Val Ala Leu Ser Asn Asp Cys Gly Gin Leu Ala 
50 55 60 

Glu lie Ala He Asp Val Thr Ser Val Tyr Val Val Gly Tyr Gin Val 
65 70 75 80 

Arg Asn Arg Ser Tyr Phe Phe Lys Aep Ala Pro Asp Ala Ala Tyr Glu 

85 90 95 

Gly Leu Phe Lys Asn Thr He Lys Thr Arg Leu His Phe Gly Gly Ser 
100 105 110 

Tyr Pro Ser Leu Glu Gly Glu Lys Ala Tyr Arg Glu Thr Thr Asp Leu 
115 120 125 

Gly He Glu Pro Leu Arg He Gly He Lys Lys Leu Asp Glu Asn Ala 
130 135 140 

He Asp Asn Tyr Lys Pro Thr Glu He Ala Ser Ser Leu Leu Val Val 
145 150 155 160 

He Gin Met Val Ser Glu Ala Ala Arg Phe Thr Phe He Glu Asn Gin 

165 170 175 

He Arg Asn Asn Phe Gin Gin Arg He Urg Pro Ala Asn Asn Thr He 
180 185 190 

Ser Leu Glu Asn Lys Trp Gly Lys Leu Ser Phe Gin He Arg Thr Ser 
195 200 205 

Gly Ala Asn Gly Met Phe Ser Glu Ala Val Glu Leu Glu Arg Ala Asn 
210 215 220 

Gly Lys Lys Tyr Tyr Val Thr Ala Val Asp Gin Val Lys Pro Lys He 
225 230 .235 240 

Ala Leu Leu Lys Phe Val Asp Lys Asp Pro Lys 

245 250 

(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

Jii) MOLECtJLE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

Gly Leu Asp Thr Val Ser Phe Ser Thr Lys Gly Ala Thr Tyr He Thr 
15 10 15 

Tyr Val Asn Phe Leu Asn Glu Leu Arg Val Lye Leu Lys Pro Glu Gly 
20 25 30 
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Asn Ser His Gly He Pro Leu Leu Arg Lye Lys Cys Asp Asp Pro Gly 
35 40 45 

Lye Cys Phe Val Leu Val Ala Leu Ser Aan Aep Aen Gly Gin Leu Ala 
50 55 60 

Glu He Ala He Aep Val Thr Ser Val Tyr Val Val Gly Tyr Gin Val 
65 70 75 80 

Arg Aen Arg Ser Tyr Phe Phe Lys Asp Ala Pro Asp Ala Ala Tyr Glu 

85 90 - 95 

Gly Leu Phe Lys Asn Thr Cys Lys Thr Arg Leu His Phe Gly Gly Ser 
100 105 110 

- Tyr Pro Ser Leu Glu Gly Glu Lye Ala Tyr Arg Glu Thr Thr Asp Leu 
115 120 125 

Gly lie Glu Pro Leu Arg He Gly He Lys Lys Leu Asp Glu Asn Ala 
130 135 140 

He Asp Aen Tyr Lye Pro Thr Glu He Ala Ser Ser Leu Leu Val Val 
145 150 155 160 

He Gin Met Val Ser Glu Ala Ala Arg Phe Thr Phe He Glu Asn Gin 

165 170 ^ 175 

He Arg Asn Aen Phe Gin Gin Arg He Arg Pro Ala Asn Asn Thr He 
180 185 190 

Ser Leu Glu Asn Lye Trp Gly Lys Leu Ser Phe Gin He Arg Thr Ser 
195 200 205 

Gly Ala Asn Gly Met Phe Ser Glu Ala Val Glu Leu Glu Arg Ala Asn 
210 215 220 

Gly Lys Lys Tyr Tyr Val Thr Ala Val Asp Gin Val Lys Pro Lys He 
225 230 235 240 

Ala Leu Leu Lys Phe Val Asp Lys Asp Pro Lys 

245 250 

(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein . 



(xi) SEQXJENCE DESCRIPTION: SEQ ID NO: 109: 

Gly Leu Asp Thr Val Ser Phe Ser Thr Lys Gly Ala Thr Tyr He Thr 
15 10 15 

Tyr Val Asn Phe Leu Asn Glu Leu Arg Val Lys Leu Lys Pro Glu Gly 
20 25 30 

Asn Ser His Gly He Pro Leu Leu lirg Lys Lys Cys Asp Asp Pro Gly 
35 40 45 

Lys Cys Phe. Val Leu Val Ala Leu Ser Asn Asp Asn Gly Gin Leu Ala 
50 55 60 
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Glu He Ala He Aep Val Thr Ser Val Tyr Val Val Gly Tyr Gin Val 
65 70 75 80 

- — Sirg'TiBn Krg"'Seir~Ty2rTph-e— phe Ly s A~sp Ala Pro Asp Ala Ala Tyr Glu 

85 90 95 

Gly Leu Phe Lye Asn Thr He Lys Thr Arg Leu Hie Phe Gly Gly Ser 
100 105 110 

Tyr Pro Ser Leu Glu Gly Glu Lys Ala Tyr Arg Glu Thr Thr Aep Leu 
115 120 125 

Gly He Glu Pro Leu Arg He Gly He Lye Lye Leu Aep Glu Asn Ala 
130 135 140 

He ABp Asn Tyr Lys Pro Thr Glu He Ala Ser Ser Leu Leu Val Val 
145 150 155 160 

He Gin Met Val ser Glu Ala Ala Arg Phe Thr Phe He Glu Aen Gin 

165 170 175 

He Arg Asn Asn Phe Gin Gin Cys He Arg Pro 'Ala Aen Asn Thr He 
180 185 190 

Ser Leu Glu Asn Lye Trp Gly Lys Leu Ser Phe Gin He Arg Thr Ser 
195 200 205 

Gly Ala Asn Gly Met Phe Ser Glu Ala Val Glu Leu Glu Arg Ala Asn 
210 215 220 

Gly Lys Lys Tyr Tyr Val Thr Ala Val Asp Gin Val Lys Pro Lys He 
225 230 235 240 

Ala Leu Leu Lys Phe Val Asp Lys Asp Pro Lye 

245 250 

(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

Gly Leu Asp Thr Val Ser Phe Ser Thr Cys Gly Ala Thr Tyr He Thr 
1 5 10 15 

Tyr Val Asn Phe Leu Asn Glu Leu Arg Val Lys Leu Lys Pro Glu Gly 
20 25 . 30 

Asn Ser His Gly He Pro Leu Leu Arg Lys Lys Ala Asp Asp Pro Gly 
35 40 45 

Lys Ala Phe Val Leu Val Ala Leu Ser Asn Asp Asn Gly Gin Leu Ala 
50 55 60 

Glu He Ala He Asp Val Thr Ser Val Tyr Val Val Gly Tyr Gin Val 
65 70 75 80 

Arg Asn Arg Ser Tyr Phe Phe Lys Asp Ala Pro Asp Ala Ala Tyr Glu 

85 90 95 



wo 94/26910 



PCTAJS94/05348 



^190- 

Gly Leu Phe Lyo hen Thr He Lye Thr Arg Leu His Phe Gly Gly Ser 
100 105 110 

Tyr Pro Ser Leu Glu Gly Glu Lyo Ala Tyr Arg Glu Thr Thr Asp. Leu 
115 120 125 

Gly He Glu Pro Leu Arg He Gly He Lys Lye Leu Aep Glu Aen Ala 
130 135 140 

He A8P Asn Tyr Lye Pro Thr Glu He Ala Ser Ser Leu Leu Val Val 
145 ^ 150 155 160 

He Gin Met Val Ser Glu Ala Ala Arg Phe Thr Phe He Glu Asn Gin 

165 .170 175 

He Arg Asn Asn Phe Gin Gin Arg He Arg Pro Ala Asn Abh Thr He 
180 185 190 

Ser Leu Glu Asn Lys Trp Gly Lys Leu Ser Phe Gin He Arg Thr Ser 
195 200 205 

Gly Ala Asn Gly Met Phe Ser Glu Ala Val Glu L^u Glu Arg Ala Asn 
210 215 220 

Gly Lys Lys Tyr Tyr Val Thr Ala Val Asp Gin Val Lys Pro Lys He 
225 230 235 240 

Ala Leu Leu Lye Phe Val Asp Lys Asp Pro Lys 

245 250 

(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acids 
(B.) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:lll: 

GlY Leu Asp Thr Val Ser Phe Ser Thr Cys Gly Ala Thr Tyr He Thr 
15 10 15 

Tyr Val Asn Phe Leu Asn Glu Leu Arg Val Lys Leu Lys Pro Glu Gly 
20 25 30. 

Aen Ser His Gly He Pro Leu Leu Arg Lys Lys Ala Asp Asp Pro Gly 
35 40 45 

Lys Ala Phe Val Leu Val Ala Leu Ser Asn Asp Asn Gly Gin Leu Ala 
50 55 60 

Glu He Ala He Asp Val Thr Ser Val Tyr Val Val Gly Tyr Gin Val 
65 70 75 80 

Arg Aen Arg Ser Tyr Phe Phe Lys Asp Ala Pro Asp Ala Ala Tyr Glu 

85 90 95 

Gly Leu Phe Lys Asn Thr He Lys Thr Arg Leu His Phe Gly Gly Ser 
100 105 110 

Tyr Pro Ser Leu Glu Gly Glu Lys Ala Tyr Arg Glu Thr Thr Asp Leu 
115 120 125 
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Gly lie Glu Pro Leu Arg He Gly lie Lys Lys Leu Asp Glu Asn Ala 
130 135 140 



I-ie—ABp-ABu— Tyr Lye Pro-rThi?-Glu -lie* Ala- Ser Ser Leu Leu Val .Val 

145 150 155 160 

He Gin Met Val Ser Glu Ala Ala Arg Phe Thr Phe He Glu Aen Gin 

165 170 175 

He Arg Aen Asn Phe Gin Gin Arg He Arg Pro Ala Aen Aen Thr He 
180 185 190 

Ser Leu Glu Asn Lys Trp Gly Lye Leu Ser Phe Gin He Arg Thr Ser 
195 200 205 

Gly Ala Asn Gly Met Phe Ser Glu Ala Val Glu Leu Glu Arg Ala Asn 
210 215 220 

Gly Lys Lys Tyr Tyr Val Thr Ala Val Asp Gin Val Lys Pro Lys He 
225 230 235 240 

Ala Leu Leu Lys Phe Val Cys Lys Asp Pro Lys % 

245 250 

(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 29 base pairs 

(B) TYPE: nucleic acid . 

(C) STRJVNDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 
TGATGCGGCC GACATCTCAA GCTTGGT6C 29 
(2) INFORMATION FOR SEQ ID NO: 113; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 
TGATGCGGCC GACATCTCAA GCTTGGTGC 29 

(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:114: 
TCTAGGTCAC CGTCTCCTCA CCATCTGGAC AGGCTGGA 38 

(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
TTCGT^GCTT GAGATGAAAC CATCTGGACA GGCTGGA ^ ^ 37 

(2) INFORMATION FOR SEQ ID NO:116: V 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 
GAGATCCAGT TGGT6CA6TC T6 22 
(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:il7: 
AGTCGTCGAC ACGATGGACA TGA(^ACCCC TGCTCAGTTT CTTGGCATCC TCCTACTCTG 60 
GTTTCCAGGT ATCAAATGTG ACATCCAGAT GACTCAGT 98 

(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 



TCACTTGCCG GGCGAATCAG GACATTAATA GCTATTTAAG CTGGTTCCAG CAGAAACCAG 60 
GGAAAGCTCC TAAGACCCT 80 



(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 
TGACTCGCCC GGC7UVGTGAT AGTGACTCT6 TCTCCTAGAG ATGCAGACAG GGAAGATGGA 60 
GACTGAGTCA TCTGGATGTC .80 



<2) INFORMATION FOR SEQ ID NO: 120: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 
GATCCACTGC CACTGAACCT TGATGGGACC CCA6ATTCCA ATCTGTTTGC ACGATAGATC 60 
AGGGTCTTAG GAGCTTTCC 79 



(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:121: 
GGTTCAGTGG CAGTGGATCT GGGACAGATT ATACTCTCAC CATCAGCAGC CTGCAATATG 
AAGATTTTGG AATTTATTAT TG 



60 
82 
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(2) INFORMATION FOR SEQ ID NO: 122: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 base pairs 

(B) TYPE: nucleic acid * 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 
GTTTGATTTC AAGCTTGGTG CCTCCACCGA ACGTCCACGG AGACTCATCA TACTGTTGAC i 
AATAATATAT TCCAAAATCT TC 

(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STIUVNDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 

Asp lie Lys Met Thr Gin Ser Pro ser Ser Met Tyr Ala Ser Leu Gly 
15 10 15 

Glu Arg Val Thr lie Thr Cys Lys Ala Ser Gin Asp lie Asn Ser Tyr 
20 25 30 

Leu Ser Trp Phe His His Lys Pro Gly Lys Ser Pro Lys Thr Leu lie 
35 40 45 

Tyr Arg Ala Asn Arg Leu Val Asp Gly Val Pro Ser Ara Phe Ser Gly 
50 55 60 ' 

Ser Gly Ser Gly Gin Asp Tyr Ser Leu ,Thr lie Ser Ser Leu Asp Tyr 
65 70 75 80 

Glu Asp Met Gly lie Tyr Tyr Cys Gin Gin Tyr Asp Glu Ser Pro Trp 

85 90 , 95 

Thr Phe Gly Gly Gly Thr Lys Leu Glu lie Lys 
100 105 

(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTIONr SEQ ID NO: 124: 

Gin lie Gin Leu Val Gin Ser Gly Pro Glu Leu Lys Lys Pro Gly Glu 

-1 5 — 10 15 - 

Thr Val Lye lie Ser Cys Lye Ala Ser Gly Tyr Thr Phe Thr Asn Tyr 
20 25 30 

Gly Met Aen Trp Val Lys Gin Ala Pro Gly Lys Gly Leu Arg Trp Met 
35 40 45 

Gly Trp lie Asn Thr Hie Thr Gly Glu Pro Thr Tyr Ala Aep Asp Phe 
50 55 60 

Lye Gly Arg Phe Ala Phe Ser Leu Glu Thr Ser Ala Ser Thr Ala Tyr 
65 70 . 75 80 

Leu Gin lie Asn Asn Leu Lys Asn Glu Aep Thr Ala Thr Tyr Phe Cys 

85 90 95 

Thr Arg Arg Gly Tyi Aep Trp Tyr Phe Asp Val Trp Gly Ala Gly Thr 
100 105 ^ 110 

Thr Val Thr Val Ser Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:125: 

Aep lie Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly 
1 5 10 15 

Asp Arg Val Thr He Thr Cys Arg Ala Ser Gin Asp lie Asn Ser Tvr 
20 25 30 

Leu Ser Trp Phe Gin Gin Lys Pro Gly Lys Ala Pro Lys Thr Leu He 
35 40 45 

Tyr Arg Ala Asn TUrg Leu Glu Ser Gly. Val Pro Ser Axq Phe Ser Glv 
50 55 60 

Ser Gly Ser Gly Thr Asp Tyr Thr Leu Thr He Ser Ser Leu Gin T^r 
65 70 75 80 

Glu Asp Phe Gly He Tyr Tyr Cys Gin Gin Tyr Aep Glu Ser Piro Trp 

85 90 95 

Thr Phe Gly Gly Gly Thr Lys Leu Glu He Lye 
100 105 
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(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 

Glu lie Gin Leu Val Gin Ser Gly Gly Gly Leu Val Lys Pro Gly Gly 
. 1 5 10 15 

Sigr Val Arg lie Ser Cys Ala Ala Ser Gly Tyr Thr Phe Thr Asn Tyr 
20 25 30 

Gly Met Asn Trp Val Arg Gin Ala Pro Gly Lys Qiy Leu Glu Trp Met 
35 40 45 

Gly Trp lie Asn Thr His Thr Gly Glu Pro Thr Tyr Ala Asp Ser Phe 
50 55 60 

Lys Gly Arg Phe Thr Phe Ser Leu Asp Asp Ser Lys Asn Thr Ala Tyr 
65 70 75 80 

Leu Gin lie Asn Ser Leu Arg Ala GLlu Asp Thr Ala Val Tyr Phe Cys 

85 90 95 

Thr Arg Arg Gly Tyr Asp Trp Tyr Phe Asp Val Trp Gly Gin Gly Thr 
100 105 110 

Thr Val Thr Val Ser Ser 
115 



(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 280 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 

Ala Ala Lys Met Ala Lys Asn Val Asp Lys Pro Leu Phe Thr Ala Thr 
1 5 10 15 

Phe Asn Val Gin Ala Ser Ser Ala Aep Tyr Ala Thr Phe lie Ala Gly 
20 25 30 

lie Arg Asn Lys Leu Arg Asn Pro Ala His Phe Ser His Asn Arg Pro 
35 40 45 

Val Leu Pro Pro Val Glu Pro Asn Val Pro Pro Ser Arg Trp Phe His 
• 50 55 60 . 

Val Val Leu Lys Ala Ser Pro Thr Ser Ala Gly Leu Thr Leu Ala lie 
65 70 75 80 
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Arg Ala Asp Aen He Tyr Leu Glu Gly Phe Lys Ser Ser Asp Gly Thr 

85 90 95 

Trp Trp Glu Leu Thr Pro Gly Leu He Pro Gly Ala Thr Tyr Val Gly 
100 105 110 

Phe Gly Gly Thr Tyr Arg Asp Leu Leu Gly Asp Thr Asp Lye Leu Thr 
115 120 125 

Aen Val Ala Leu Gly Arg Gin Gin Leu Ala Asp Ala Val Thr Ala Leu 
130 135 140 

Hie Gly Arg Thr Lye Ala Asp Lye Ala Ser Gly Pro Lye Gin Gin Gin 
145 150 155 160 

Ala Arg Glu Ala Val Thr Thr Leu Val Leu Met Val Aen Glu Ala Thr 

165 170 175 

Arg Phe Gin Thr Val Ser .Gly Phe Val Ala Gly Leu Leu Hie Pro Lys 
180 185 190 

Ala Val Glu Lys Lys Ser Gly Lye He Gly Asn^Glu Met Lys Ala Gin 
195 200 205 

Val Aen Gly Trp Gin Asp Leu Ser Ala Ala Leu Leu Lye Thr Aep Val 
210 215 220 

Lye Pro Pro Pro Gly Lye Ser Pro Ala Lys Phe Ala Pro He Glu Lys 
225 230 235 240 

Met Gly Val Arg Thr Ala Glu Gin Ala Ala Aen Thr Leu Gly He Leu 

245 250 255 

Leu Phe Val Glu Val Pro Gly Gly Leu Thr Val Ala Lye Ala Leu Glu 
260 265 270 

Leu Phe Hie Ala Cyo Gly Gly Lye 
275 280 



(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS r 

(A) LENGTH: 280 amino acids 

(B) TYPE: amino acid 
(D) TOPOLCX;ys linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 128: 

Ala Ala Lys Met Ala Lye Asn Val Asp Lys Pro Leu Phe Thr Ala Thr 
15 10 15 

Phe Asn Val Gin Ala Ser Ser Ala Asp Tyr Ala Thr Phe He Ala Gly 
20 25 30 

He Arg Asn Lys Leu Arg Asn Pro Ala Hie Phe Ser His Asn Arg Pro 
35 40 45 

Val Leu Pro Pro Val Glu Pro Asn Val Pro Pro Ser Arg Trp Phe His 
50 55 60 

Val val Leu Lys Ala Ser Pro Thr Ser Ala Gly Leu Thr Leu Ala He 
65 70 75 80 
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Arg Ala Aep Aon He Tyr Leu Glu Gly Phe Lys Ser Ser Asp Gly Thr 

85 90 95 

Trp Trp Glu Leu Thr Pro Gly Leu He Pro Gly Ala Thr Tyr Val Gly 
100 105 110 

Phe Gly Gly Thr Tyr Arg Asp Leu Leu Gly Aop Thr Aep Lys Leu Thr 
115 120 125 

Aen Val Ala Leu Gly Arg Gin Gin Leu Ala Asp Ala Val Thr Ala Leu 
130 135 140 

His Gly Arg Thr Lys Ala Asp Lys Ala Ser Gly Pro Lye Gin Gin Gin 
145 150 155 160 

Ala Arg Glu Ala Val Thr Thr Leu Val Leu Met Val Aen Glu Ala Thr 

165 170 175 

Arg Phe Gin Thr VaX Ser Gly Phe Val Ala Gly Leu Leu His Pro Lys 
180 185 190 

Ala Val Glu Lys Lys Ser Gly Lys He Gly Asn cfu Met Lys Ala Gin 
195 200 205 

Val Asn Gly Trp Gin Asp Leu Ser Ala Ala Leu Leu Lye Thr Asp Val 
210 215 220 

Lys Pro Pro -Pro Gly. Lye Ser Pro Ala Lys Phe Ala Pro He Glu Lys 
225 230 235 240 

Met Gly Val Arg Thr Ala Glu Gin Ala Ala Asn Thr Leu Gly He Leu 

245 250 255 

Leu. Phe Val Glu Val Pro Gly Gly Leu Thr Val Ala Lys Cys Leu Glu 
260 265 270 

Leu Phe His Ala Ser Gly Gly Lys 
275 280 

(2) INFORMATION FOR SEQ ID NO:129; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 280 amino acias 
<B) TYPE: amino acid 
(D) TOPOLOGy: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:129: 

Ala Ala Lys Met Ala Lys Asn Val Asp Lys Pro Leu Phe Thr Ala Thr 
15 10 15 

Phe Asn Val Gin Ala Ser Ser Ala Asp Tyr Ala Thr Phe He Ala Gly 
20 25 30 

He Arg Asn Lys Leu Arg Asn Pro Ala His Phe Ser His Asn Arg Pro 
35 40 45 

Val Leu Pro Pro Val Glu Pro Asn Val Pro Pro Ser Arg Trp Phe His 
50 55 60 

Val Val Leu Lys Ala Ser Pro Thr Ser Ala Gly Leu Thr Leu Ala He 
65 70 75 80 
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Arg Ala Asp Asn lie Tyr Leu Glu Gly Phe Lys Ser Ser Asp Gly Thr 

85 90 95 

Trp Trp-GOu -Leu Thr Pro Gly Leu lie Pro Gly Ala Thr Tyr Val Gly 
100 105 110 

Phe Gly Gly Thr Tyr Arg Asp Leu Leu Gly Asp Thr Asp Lys Leu Thr 
115 120 125 

Aen Val Ala Leu Gly Arg Gin Gin Leu Ala Asp Ala Val Thr Ala Leu 
130 135 140 

His Gly Arg Thr Lye Ala Asp Lys Ala Ser Gly Pro Lys Gin Gin Gin 
145 150 155 160 

Ala Arg Glu Ala Val Thr Thr Leu Val Leu Met Val Asn Glu Ala Thr 

165 170 175 

Arg Phe Gin Thr Val Ser Gly Phe Val Ala Gly Leu Leu His Pro Lys 
180 185 190 

Ala Val Glu Lys Lye Ser Gly Lys lie Gly Asn iGlu Met Lye Ala Gin 
195 200 205 

Val Asn Gly Trp Gin Asp Leu Ser Ala Ala Leu Leu Lys Thr Asp Val 
210 215 220 

Lye Pro Pro Pro Gly Lys Ser Pro Ala Lys Phe Ala Pro lie Glu Lys 
225 230 235 240 

Met Gly Val Arg Thr Ala Glu Gin Ala Ala Asn Thr Leu Gly He Cys 

245 250 255 

Leu Phe Val Glu Val Pro Gly Gly Leu Thr Val Ala Lys Ala Leu Glu 
260 265 270 

Leu Phe His Ala Ser Gly Gly Lys 
275 280 



(2) INFORMATION FOR 5EQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTIONS SEQ ID NO:130i 

Ser Cys Asp Lys Thr His Thr 
1 5 



(2) INFORMATION FOR SEQ ID NO? 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 
TGTCGACATC ATGGCTTGGG TGTGGACCTT GCTATTCCTG ATGGCAGCT GCCCAAAGTG 60 
CCCAAGCAGAGATC CAGTTGGTGCA G 82 
(2) INFORHATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 base pairs 

(B) TYPE: nucleic acid 

(C) STR7VNDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 132: 

AAGGTATACC CAGAAGCTGC GCAGGAGATT CTGACGGACC CTCCAGGCTT CACCAGGCCT 60 

CCTCCAGACT GCACCAACTG GATCTC 86 

(2) INFORMATION FOR SEQ ID NO: 133: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 
GCAGCTTCTG GGTATACCTT CACAAACTAT 6GAATGAACT G6GTGC6CCA GGCTCCAG6A 60 
AAGAATTTA6 AGTGGATGGG CTGG .84 



(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 

AAAGAGAAGG TAAACCGTCC CTTGAAAGAA TCAGCATATG TTGGCTCTCC AGTGTGGGTG 
TTTATCCAGC CCATCCACTC TAAAC 



60 
85 
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(2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 
^fA) "LENGTH: - 87 baBe-pairB 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 
GACGGTTTAC CTTCTCTTTG GACGATTCTA AGAACACTGC CTATTTACAG ATCAACAGCC 60 
TCAGAGCCGA GGACACGGCT GTGTATT 87 
(2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 base pairs • 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: eingle 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

GAGGAGACGG TGACCGTGGT CCCTTGGCCC CAGACATCGA AGTACCAGTC GTAACCCCGT 60 
CTTGTACAGA AATACACAGC CGTGTCCTCG GC . 92 

(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 
GCAGCTTCTG GGTATACCTT CACAAACTAT GGAATGAACT GGGTGAAGCA GGCTCCAGGA 60 
AAGG6TTTAA GGTGGATGGG CTGG 84 
(2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 



AAAGAGAAGG TAAACCGTCC CTTGAAGTCA TCAGCATATG TTGGCTCTCC A6TGTG6GTG 



60 



TTTATCCAGC CCATCCACCT TAAAC 



85 



(2) INFORMATION FOR SEQ ID NO: 139: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 
GACGGTTTAC CTTCTCTTTG GACACGTCTA AGTGCACTGC CTATTTACAG ATCAACAGCC 60 



(2) INFORMATION FOR SEQ ID NO:140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 base paire 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOs140j 
AGGAGACGGT GACCGTGGTC CCTTGGCCCC AGACATCGAA GTACCAGTCG TAACCCCGTC 60 
TTGTACAGAA ATATGTAGCC GTGTCCTCGG C . 91 

(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHAP-ACTERISTICS : 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:141: 

Lys Pro Ala Lye Phe Phe Arg Leu 
1 5 



(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



TCAGAGCCGA GGACACGGCT ACAT 



84 
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(xi) SEQXJENCE DESCRIPTION: SEQ ID NO: 142: 

Lye Pro Ala Lys Phe Leu Arg Leu 

-i- 5 



(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 
GGCCGCAAAG CCGGCTTIAGT TCTTMCGTCT GAGT 
(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 
ACTCAGACGK AAGAACTTAG CCGGCTTTGC 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule I3bis) 

A. The indications made below reJaie lo ihe microoTganism referred lo in ibe description 

on page , line . , See attached sheet 

B. IDENTIFICATION OF DEPOSIT Further deposits are idenlified on an additional sheet 
Name of depositary insiituiion 

American Type Culture Collection 

Address of depositary institution (including postal code and country) 

. 12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit 


Accession Number 


See attached sheet 


See attached sheet 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This infomjalion Is continued on an additional sheet Q 



"In respect of those designations in which a European patent is sought > 
a sample of the deposited microorganism will be made available until the 
publication of the mention of the grant of the European patent or until the 
date on which the application has been refused or withdrawn or is deemed to 
be withdrawn, only by the issue of such a sample to an expert nominated by 
the person requesting the sample. (Rule 23(A) EPC)." 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all tUsignased States) 
EPO 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the Inlemaiiona 1 Bureau laier (specify the general nature of the indteotiora e.g,, 'Accession 
Number of Deposit') 



For receiving Office use only 




For International Bureau use only 



I I This sheet was received by the International Bureau on: 



Authorized officer 
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CLAIMS 

1. A fusion protein comprising gelonih fused to 
the amino or carboxy terminal end of an antigen-binding portion 
of he3 antibody. 

2. The fusion protein of claim 1 wherein said 
fusion protein lacks a cleavable peptide segment linking said 
gelonin and said antibody portion. 



3. The fusion protein of claim 1 which is VlVb- 



Gel. 



4. The fusion protein of claim 1 which is VhVl- 



Gel- 



5. The fusion protein of claim 1 which is Gel- 



6. The fusion protein of claim 1 which is Gel- 



VlVh. 



7. The fusion protein of claim 1 which is 

Gel: : kappa, Fd. 

8. The fusion protein of claim 1 which is 

Gel: :Fd, kappa. 

9. A fusion protein comprising gelonin fused to 
the amino or carboxy terminal end of an antigen-binding portion 
of an antibody, said fusion protein further comprising the 
peptide segment CCL between said gelonin and said antibody 
portion 4 
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10* A fusion protein comprising gelonin fused to 
the amino or carboxy terminal end of an antigen-binding portion 
of an antibody, said fusion protein further comprising the 
peptide segment CCF between said gelonin and said antibody 
portion. 

11. The fusion protein (Gel: :RMA: :kappa,Fd' )2- 

12. The fusion protein (Gel: :RMA: :Fd' , kappa)2. 

13. The fusion protein Gelonin: :RMA: : kappa, Fd. 

14. The fusion protein Gelonin: :RMA: :Fd, kappa. 

15. The fusion protein Gelonin: :SLT: : kappa, Fd. 

16. The fusion protein Gelonin: :SLT: :Fd, kappa. 

17. The fusion protein VaV^: : SLT: : Gelonin. 

18. The fusion protein VlVh: : SLT: : Gelonin. 

19. The fusion protein VhVl: :RMA: : Gelonin. 

20. The fusion protein VlVh: :RMA: :Gelonin* 

21. A polynucleotide encoding the fusion protein 
of claim 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 
17, 18, 19 or 20. 

22. A host cell transformed or transfected with 
the fusion protein of claim 21. 
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pos 10 20 30 flO SC 

bind ♦-♦♦-f*+ o^'^-*'*'^'^©*-*--*^ • + •»'^- ooo* + *o+r 

bury 4.--».-*-+ o^.^o + -^,.^+-^--r-*--r--r-^^ ,.-r --.o-^ + ^-ro-*- — « 

mod • . • 

hKl DIQMTOS PSSLSASVGDRVTITCrASQx Is xyLxWYQQKPGkAPklllY aAS 

hK3 EIVLTQS ?gTL51SPGERATLSCRASQS vs 5syLAWyQQK?GQA?RLLI V qaS 

hK2 DIVMTQS PLSLPVTPGZPASISCR3S0S nnYLnWYLCKPGQSpQLLI Y I5S 

h'Ll xSVLTQP PS aSgtPGQrVtlSCsGssS iG xnxVxWYQqlPGtAPKLLI Y n n 

hL2 xSALTQP aS VSGSPGQSiTISCtGt ss Vg ynxVSWyQwhPGkAPK Lly dv 

hL3 SYeLTQP PS vSVsPGQTA ITCsGdx Ix xxyvxWYQQkPgQatPvlVlY d 

fiLg ' nfmitqp ns vsespgktvt iiC' xsxg la sxyvqvyqqrpgsapctvty edn 

hK4 divmtqa pdslavslqerat inckssqs vl knylawy^qkpgqppiclliy was 

hL4 seltqp pa vsvapgqt ritcagdx ig xydaxwyqqxpgqapllviy grr, 

hL5 saitqp ps asgspgqsvt iactgt ss vg xxyvswyqqh g apk I ev 

pos 60 70 80 90 100 

bind -■^00♦'^0+ ■•"♦•+- + -* + + -r^**4- + -n-».-».^ + * O— • -O o*-r-»'*'*'- 

bury -»-'^o+^-o*o- + --^o+o+-^ + ♦4.~« .^-^.^-.^^-r*- 

mod •• ••••*• 

hKl xLxaGVPSRFsGSGSGTx FTlTISsLOpeDf ATYYCqqyxxxP xt FGqGTkv eik 

hK3 3RATGIPdRr SGSgSGTD FTLTISr LEPGOFAVYYCQOYgssP xT FGQGTKv tlK 

hK2 nRaSGVPDRr SGSGSGTD FTLklSRVeAEDVGVYVCMQaiqxP xT FGQGTkx £IK 

hLl BPSGvPDRFSgSKSGTS AsLalaGLqseDeaDYYCacWDaSLd pV FGGGTk TVLg 

hL2 RPSGv RFSGSKSGfiT ASLTIS&i.0aeDEAdYYC5SYxgxxx xV FGgGtKltVlg 

hL3 RPSglPQRFSGS St T ATLTISGvqa DEADYYCqxwDxxx vv FGGGTkLTVLg 

hL6 rpsgvpdrf 3ga33 ns aslLisglktedeadyycqsyosxx wv fgggtkltvlg 

hK4 resgvpdrfsgsgsgtid f tit isslqaqdvavyycqqyystp xt tgqgc^cx gik 

hL4 rpsgipdrf sgsssght asltitgaqaedcadyycnsrdssgk vi fgggtkltvlg 

hL5 rpsgvpdrf sgska nt aaltvs^l aedcadyycssyxxxxx fv fg gtx tv). 



FIG. lOA 
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po* 10 20 30 ' - 40 50 

bind ♦©+ 0'»' ^-r-f o + -»" + + '»'**-*0O-- G00"^♦^0♦ + •^^0*00* 

bury 4-- 1-'+-+•^0'»' o t^u"*^ •'^•o-*-*'^ ♦©'♦'-•-■o~'»-+'t"»'0*o«»-o»-ooT-»-o«-'r 

mod • • . . ... 

hH3 EVQLvESGGG LVqPGGSLRLSCAASGFtr 3 xxxmxWVKQApGKgLEWVxxl xxKXxgx 

hHl QVqLvqSGaE VkKPGxSvxvSCKxSGyyFx xyxixWvRQaPGxGtEWvGxixpxxcxL 

hH2 xvxlxesgpx Ivlptqt 1 1 luct vsgxals xxxvjcwirqppgXxiewIaxix xodd 

po% 60 70 aO 90 100 1*0 

bind -0000*0+01- 1 ©-►-'►o ■^'•'O't'O ■^♦♦•^'^ ■♦'^'•'*"^'*"*'**o*- ---•©"►^"♦^•^•^^ 

bury »o*'*o*"**'*»'0-'*'-"-*0'^o'* •♦■o+—'^ * ^^ooooo**—^- ""O^" 

rood 

hH3 xyadSWGRFTISRddSKNt lyLqMnsLraeOTAvYYCarxwxxxxxxxxWGqGTlVTvSS 

hHl xyapxf qgRVTxcrdxSxntayMeLxsLraeDtAvVYCArxxxxxxxxxxwgqGtlvtVSS 

U\i2 xyxtalrarltiskdtsknqvvlxxxxxdpxdtAtyycarxxxxxxxxdvwgqgttvcvas 
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SH65K-1 

ACT CGT CGA CAC GAT GGA CAT GAG GAC CCC TGC TCA GTT TCT TGG 
CAT CCT CCT ACT CTG GTT TCC AGG TAT CAA ATG TGA CAT CCA GAT 
GAC TCA GT 

HUH~K1 

TGA CTC GCC CGG CAA GTG ATA GIG ACT CTG TCT CCC AGA CAT CCA' 
GAC ATG GAA GAT GAG GAC TGA GTC ATC TGG ATG TC 

HUH-K2 

TCA CTT GCC GGG CGA GTC AGG ACA TTA ATA GCT ATT TAA GCT GGT 
TCC AGO AGA AAC CAG GGA AAT CTC CTA AGA CCC T 

GAT CCA CTG CCA CTG AAC CTT GAT GGG ACC CCA TCT ACC AAT CTG ' 
TTT GCA CGA TAG ATC . AGG GTC TTA GGA GAT TTC C 



HUK''K4 

GGT TCA GTG GCA GTG GAT CTG GGA CAG ATT ATA CTC TCA CCA TCA 
GCA GCC TGC AAT ATG AAG ATT TTG GAA TTT ATT ATT G 

HUH-K5 

GTT TGA TTT CAA GCT TGG TGC CTC CAC CGA ACG TCC ACG GAG ACT 
CAT CAT ACT GTT v»AC AAT AAT AAA TTC CAA AAT CTT C 



HUH-Gl 

TGT CGA CAT CAT GGC TTG GGT GTG GAC CTT GCT ATT CCT GAT GGC 
AGO TGC CCA AAG TGC CCA AGC ACA GAT CCA GTT GGT GCA G 

HUH-G2 

AAG GTA TAC CCA GAA GCT GCG CAG GAG ATT CTG ACG GAC 

CCT CCA GGC TTC TTC AGG CCA GGT CCA GAC TGC ACC AAC TGG ATC T 

HUH-G3 

GCA GCT TCT GGG TAT ACC TTC ACA AAC TAT GGA ATG AAC TGG GTG 
AAG CAG GCT CCA GGA AAG GGT TTA AGG TGG ATG GGC TGG 

HUH-G4 

AAA GAG AAG GTA AAC CGT CCC TTG AAG TC A TCA GCA TAT GTT GGC 
TCT CCA GTG TGG GTG TTT ATC CAG CCC ATC CAC CTT AAA C 

HUH-G5 

GAC GGT TTA CCT TCT CTT TGG ACA CGT CTA AGT GCA CTG CCT ATT 
TAC AGA TCA ACA GCCTCA GAG CCG AGG ACA CGG CTA CAT 

HUH-G6 

*CG AGA CC^ TGA CCG TGG TCC CTT GGC CCC AGA CAT CGA AGT ACC 
ACT CGT AAC CCC GTC TTG TAC AGA AAT ATG TAG CCG TGT CCT CGG C 

H6SG-2S 

ACT AGT GTC GAC ATC ATG GCT TGG GT 



H65-G2 

GAG GAG ACG GTG ACC GTG GT 
H65K*2S 

ACT CGT CGA CAC GAT GCA CAT GAG GAC 

FIG* 11 



JKl-Hlndlll 

GTT TGA TTT CAA GCT TGG TGC 



wo 94/26510 



PCTAJS94/05348 



13/14 



pos 

EU 

hKl 

TAC 

bind 

bury 

mod 

M/H 

prop 

Que 

pos 

EU 

hKl 

TAC 

bind 

bury 

mod 

M/H 

prop 

Que 



DICMTCS 
DIQMTQS 
QIVLTQS 



10 20 30 

PSTLSASVGDBVTITCRASQS IN.. 
PSSLSASVGDRVTITCrASQx Is 

?aimsas?gekvt:tcsasss is 



40 

twlawyqqkpgkapkilmy 

xyLxWYQQKPGkAPklLIY 
YMHWFQQKPG7SPKLWIY 

- — — OOO + O -"^ — — ~ — 



H HM HKM M HH h M 

DIQLTQS PSSMSASPGDRVTITCHASSS IS 
DIQMTQS PSTLSASVGDRVTI7CSASSS IS 



M MM M hM M 

YMHWFQQKPGKSPKtWIY 
YMHWYQQKP GKAP KLL I Y 



60 70 
SLESGVPSRFIGSGSGTE 
xLxsGVPSRFsGSGSGTx 
NLASGVPARFSGSGSGTS 

- ♦OO O * 

+ ♦ o + + - o + o - - * o + o 

• • 9 • • • • 

M M H ^ 
NLASGVPSRFSGSGSGTS 
NLASGVPARFSGSGSGTE 



80 90 
FTLTISSLQPDDFATYYCQQYNSDS 
FT IT I S S LQpe 0 f AT Y YCqqijxxxP 
YSLTISRMEAEDAATYYCHQR5TYP 



50 

aAS 
TTS 

+ ••--»• 

MM 

TTS 

TTS 



- 4- + + + + ^ + + +-f+-0' 



100 

KM FGQGTKV EVK 
xt FGqGTkv eik 
LT FGSGTKL ELK 



MH hMHm h M MMMM 

YTLTISSMQAEDFATYYCHQRSTYP 

FTLTISSLQPDDFATYYCHQRSTYP 



M h M .-n 
LT FGQGTKL ELK 
LT FGQGTICV EVK 
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DCS 10 20 30 40 50 

SU CVQLVCSGAE VKKPGSSVKVSCKASGGTFS RSAI IWRQAPGQGL£WMGGI VPMFG?? 

hHl CVqLvqSGaE VkKPGxSvxvSCKxSGyyFx xyxixWvRQaPGxGLEWvGxixpxxaxt. 

TAG CVOLQQSGA£ lAKPGASVKMSCKASGYTrT S YRMHWVKORPGQGLEWIGYINPSTGYT 

bind o-''0'»'-^'^''0'«* 'r*'^o*-*'*'^''**-*oo-*--- ooo++'^Oi-+'t--o-oo-- 

bury '♦.-■►--•■-•♦■•►o* ■••0'*-o-»'+'*'-'»'-+^'^-'^"0'»— "^ ^-O"**-— — o»■•■++•^o•o•-o•^oo-»••♦'0■»^'^ 

mod • • . • 

M/H M hM m mM M M M M MMM M h m M M M MM M 

crop QVQLQQSGAE VAKPGASVKMSCKASGYTFT 5 YRMHWVKQAPGQGLEW.IGYINPSTGYT 

:Que QVQLVQSGAE VKKPGSSVKVSCKASGYTFT S YRMHWVRQAPGQGLEWIGYINPSTGYr 

pos 60 70 80. 90 100 110 

EU " NYAQKFQGRVTITADESTNTAYMELSSLRSEDTAFYFCAG GYGIY SPEEYNGGLVTVSS 

hHl xyapxfqgRVTxtrdxSxntayMeLxsLrseDtAvYYCArxxxxxxxxxxwgqGt Ivt VSS 

TAG EYNQKFKDKATLTADKSSSTAYMQISSLTFEDSAVYYCAR*GGGV FDYWGQGTTirVSS 

bind -oooo+o+o*»'+0"*'*'»*o++o+o++-*'+*'^++* + -»"*"^+++-o-— --0+ +•♦•■*•+* + 

bury »o"-»-o-'»-*o-'^-'»'0+o+"*'0+-'-»--'+*+*-'»'++-"«'*o-"**-*»ooooo"*-"-**o+-*--*"^ 

mod • •••••••••••••• 

M/H M MMM MhMM MMMmM hmhhh MMMM MMM M 

prop EYNQKFKGKATLTAOKSSSTAYMELSSLRSEDTAVYYCAR GGGV FDYWGQGTTLTVSS • ■ 

Que EYNQKFKDKATITADESTNTAYMELSSLRSEDTAVYYCAR GGGV FDYWGQGTLVTVSS 
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